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pPRPRTPTION 

Methods for Pheno *VP e Creation 
Tram Mult ipl a GehP Populations 


— — p^-oss Reference t o Related Application 

This is a continuation-in-part application of copend- 
ing application Serial No. 513,957, filed April 24, 1990 
which is a continuation-in-part of Serial # No. 353,235, 
5 filed May 16, 1989, and Serial No. 353,241, filed May 17, 
1989, the disclosures of which are hereby incorporated by 
reference. 

fjeld of t he Tnvention 

The present invention relates to methods for randomly 
10 combining populations of nucleotide sequences and select- 
ing those combinations coding for a desired predetermined 
phenotype. 

parttaround of the Tnvention 

The production of genetic variants, including vari- 

15 ants of both polypeptides and organisms such as bacteria 
and phage, has been a goal in the work of many individuals 
involved in recombinant DNA technologies. For example, 
researchers have beneficially relied upon random genetic 
recombination in the past for the production of new and 

20 useful microorganisms. Genetic recombination includes a 
variety of * processes that produce new linkage relation- 
ships of genes or parts of genes. Genetic recombination 
is often subdivided into general genetic recombination, 
which takes place between homologous chromosomes, more or 

25 less anywhere along their length, and recombination that 
does not require extensive homology. The latter category 
includes site-specific recombination, which depends upon 
th exist nee of specific sites in one or m re molecules 
and which includes int ractions of viral genom s and 

30 insertion sequences with chromosomes of prokaryotes and 
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euXaryotes, and less well defined instances of recombina- 
tion that appear to require neither extensive homology nor 
special sites. Variable gene expression can also result 
in production of various combinations of polypeptides, the 

_5__immune„-system— being— one— example— of— such— protein" 

combination. 

The immune system of a mammal is one of the most 
versatile biological systems? probably greater than 1.0 x 
10 7 antibody specificities can be produced. Indeed, a 

10 great deal of contemporary biological and medical research 
is directed toward tapping this repertoire. During the 
last decade, furthermore, there has been a dramatic 
increase in the ability to harness the output of the 
immune system. The development of the hybridoma method- 

15 ology by Kohier and Milstein has made it possible to 
produce monoclonal antibodies, i t e,, a composition of 
antibody molecules of single epitope specificity, from the 
repertoire of antibodies induced during an immune 
response. Monoclonal antibodies have been generated in 

20 the past from hybridomas, generated by fusing antibody- 
secreting lymphocytes with an immortal cell line, such as 
myeloma. 

Although standard hybridoma technology has been 
extremely valuable, the screening of fused cells to iden- 

25 tify hybridomas expressing useful antibody molecules is 
labor intensive/ time consuming and expensive. Moreover, 
the standard technology yields rodent antibody molecules 
that have two clear disadvantages. The first is that 
subtle variations in certain human antigenic systems, such 

30 as major histocompatibility proteins, are not easily dis- 
tinguished by non-primate antibodies. Therefore, rodent 
antibodies may not provide the repertoire of specificities 
needed to distinguish certain polymorphic antigenic deter- 
minants. In other words, current methods for generating 

35 mon clonal antibodies are not capable of efficiently 
surveying the entire antibody response induced by a 
particular imxnunogen. Thus, in an individual animal there 
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arc at least 5-10,000 different B-cell clones capable of 
generating unique antibodies to a small relatively rigid 
immunogens, such as, for example dinitrophenol . Further, 
because of the process of somatic mutation during the 
-5-generation-of_antibQd y ^diversity,, essentially an unlimited 
number of unique antibody molecules may be generated. In 
contrast to this vast potential for different antibodies, 
current hybridoma methodologies typically yield only a few 
hundred different monoclonal antibodies per fusion. A 

10 second major drawback in hybridoma technology is that 
rodent antibodies are highly immunogenic in humans, and 
can preclude their continued use in patients for 
diagnostic or therapeutic purposes. 

one alternative is to produce human cells that 

15 express antibody. Unfortunately, it is quite difficult to 
identify and produce pure human monoclonal antibodies. 
Standard methods used to immortalize antibody-producing 
cells are less than satisfactory. One approach that 
circumvents the need for human hybridoma cells has been to 

20 use recombinant DNA technology to express fusion antibody 
proteins. These molecules have amino terminal variable 
domains of the light and heavy chains derived from a 
specific rodent monoclonal antibody and the carboxy 
terminal constant region domains derived from a human 

25 antibody. The use of human constant regions diminishes 
the human anti-globulin immune response,- avoiding the 
stimulation of anti^isotypic antibody-producing B. cells. 
However, the rodent-derived variable region framework 
domains still elicit a response that is more severe than 

30 a variable domain response directed against a pure human 
antibody. 

in an effort to avoid the anti-idiotypic response 
directed against the rodent framework regions of the 
domains, some researchers have taken a human antibody an* 
replaced th hypervariable regions (CDRs) with 
hvpervariable regions from a rodent antibody specific f r 
a selected antigen. Although such antibodies may have an 
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affinity for antigen comparable to the parent rodent 
antibody, the process of grafting all rodent CDRs into a 
human immunoglobulin gene is technically challenging. 

Aside from repertoire specificity and immunogenicity , 
other drawbacks in producing monoclonal antibodies with 
the hybridoma.method^lo^^ 


1 ^ ro5 ~^ on ^ apac ^ One means 

by which the art has attempted to overcome these latter 
two problems has been to clone the immunoglobulin- 
10 producing genes from a particular hybridoma. of interest 
into a procaryotic expression system. See, for example, 
Robinson et al., PCT Publication No. WO 89/0099; Winter et 
al European Patent Publication No. 0239400; Reading, 
U.s! Patent No. 4,714,681; and Cabilly et al . , European 
15 patent Publication No. 0125023. 

The immunologic repertoire of vertebrates has 
recently been found to contain genes coding for 
immunoglobulins having catalytic activity. Tramontane et 
al sci. . 234:1566-1570 (1986); Pollack et al., g£Lu, 
20 234!l570-1573 (1986); Janda et al., Sfii.., 244:437-440 
(1989) . The presence of, or the ability to induce the 
repertoire to produce, antibody molecules capable of a 
catalyzing chemical reaction, i.e., acting like enzymes, 
had previously been postulated almost 20 years ago by W. 
25 P . Jencks in in Chemistry an* Fnzvmoloqy , 

McGraw-Hill, N.Y. (1969). 

It is believed that one reason the art failed to 
isolate catalytic antibodies , from the immunological 
repertoire earlier, and its failure to isolate many to 
30 date even after their actual discovery, is the inability 
to screen a large portion of the repertoire for the 
desired activity. Another reason is believed to be the 
bias of currently available screening techniques, such as 
the hybridoma technique, towards the production high 
affinity antibodies inherently designed for participation 
in the process of neutralization, as opposed t catalysis. 


35 
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in an attempt to enhance the designed recombination 
of desired DMA sequences or the desired combination of 
otherwise randomly generated polypeptides, including the 
identification and production of pure human monoclonal 
5 antibodies, we have pursued alternative approaches for the 

, prod uc*ion-and^creening^ S uch^^ 

polypeptides. 

grnnmarv the In vention 

The present invention is directed to methods f or pro- 

10 ducing biological agents having a desired novel phenotype 
wherein this phenotype results from expression of a 
particular combined nucleotide sequence and wherein said 
phenotype can be used to identify the biological agents 
having the particular combined nucleotide sequence and 

15 distinguish them from biological agents having other 
combined nucleotide sequences, ^e^des ired phenotype is 
typically a r ^n»t™ e__which is not norma lly expressed by ^ ^ 
^ ^are^^IeoHoTle^i ic^. in one embodiments these 
^^dTl^prise first Replicating at least portions of 

20 two parent nucleotide sequences. The replicating step 
yields a population of diverse replicas of parent nucleo- 
tide sequences. In one embodiment, each parent nucleotide 
sequence initially comprises a population (or family) of 
diverse nucleotide sequences which is replicated to give 

25 a population of diverse replicas. Alternatively, a popu- 
lation of diverse replicas is generated by replicating a 
parent nucleotide sequence under conditions which allow 
mutations to occur which generates diversity from one 
parent nucleotide sequence and results in a population of 

30 diverse replicas. In one aspect, the parent nucleotide 
sequences may comprise a single DNA molecule or alterna- 
tively the parent nucleotide sequences comprise separate 
DNA molecules. Where the parent nucleotide sequences com- 
prise one DNA molecul , after replication, the resulting 

35 populations f diverse replicas deriv d from each parent 
nucle tide sequence are separated. The populations of 
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diverse replicas are then brought together, preferably in 
a random manner, to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas. The parent 
5 nucleotide sequences may be suitably replicated and 
— — — — —brought— together— according-to— the— various— methods- 
described herein for replication and recombination of 
nucleotide sequences and generation of combinatorial 
libraries. The combined nucleotide sequences are 
10 expressed in biological agents. Such biological agents 
may comprise a host cell, or alternatively, a plasmid, 
bacteriophage 6r virus, or nucleic acid vector, and such 
suitable means for expression are described herein. In 
on e embodiment expression may constitute t he mere exist- 
15 ^ of th enucleotide seq u ences in the~iame biological 
g^t. T hen, the bi o logical agents whic h express the 
de^d phenotvpear e_identified . If desired, the pheno- 
type is used to di stinguish those biological agents 
expressing the particular combined nucleotide sequence 
20 from biological agents expressing other combined nucleo- 
tide sequences. The desired phenotyp e may comprise^ a 
^^gtldeTgore toan one polypeptide . or a multimeric 
V -J. "polypeptide, the expression of which is^ detectable. 

' Alternativel y, the phenotype may comprise synthesis of_one 

-^Mjb^L 25 or nor e RNA molecul es. Optionally, either the polypep- 

tides or RNA may exhibit enzymatic activity or receptor 
activity; or the DNA or RNA may simply act as a target for 
interaction with other molecules. 

The present invention provides novel methods for the 
30 cloning of cells having novel phenotypes. These methods 
generally include the use of a combinatorial library 
selection system to generate a diverse collection of 
clones. in one aspect, the methods utilize at least two 
starting populations of nucleotide sequences which can be 
35 recombined to form a library of clones containing nucleo- 
tide sequences from each of the parent populations. These 
nethods can be utilized, therefore, to create cells having 
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nova! phenotypes, that is, cells having . new and desired 
combination of expressed polypeptides. These methods can 
als0 be used for the production of new oombinations of 
poxypeptides. including the polypeptides utilized in the 
'option of biologically content immunoglobulin mole- 
cules. » ^ocordahce with the latter-object-of-the 
invention, these methods can be used to screen a larger 
"rtion of the immunological repertoire for receptors 
^ . Preselected activity than has heretofore been 
pcIsiL. thereby overcoming the before-mentioned 

^adequacies of ^br£oma "tf^esent invention 

In another embodiment , tne pre&e» 
contemplates a gene library comprising an isolate .admix- 
ture of at least about 10 s , preferably at least about 10 
^ more preferably at least 1.V V,-and/or VccOing DNA 
• homologs. a plurality of which share a conserved antigenic 
Lterminant. Preferably, the homologs are P""-*j»» 
medium suitable fcf 1n v(W manipulation, such as water, 
phosphate buffered saline and the liXe, which maintains 
the biological activity of the homologs. ,., viona 
in one embodiment, at least two starting populations 
of DNA sequence-containing vectors are physically combined 
% any of several techniques, including those described 
nerein, to form a library of clones 
sequences from each of the parent populations Alterna- 
tively there may be more than two gene families andtte 
vectors produced thereby may contain a random 
of one member of each gene family to create the 
.bje characteristic. These vectors can then be 
transferred to desired host cells to create in novel 
combinations cf phenotypic characteristics in the host 
cell. Methods of combining desired WK sequences include 
L use of restriction digestion and ligation, homologous 
^combination, and site-specific recombination by methods 
"eluding intergras -related proteins, flp "co-bin- 
catalyzed recombination, the sra-laa syst » of bacterio- 
phage PI, and the use of transposons. 
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In a still further embodiment, the present invention 
contemplates vectors for use in the methods which 
comprise, in addition to random DNA sequences from the 
starting gene family populations, DNA sequences which 
facilitate the region-specific, random recombi nation 
together of at least one gene from each starting gene 
family population. Sequences enabling the recombination 
of these vectors include the use of functional f lp recom- 
bination sequences, functional loxp recombination 
10 sequences, at sequences recognized by integrase-related 
proteins from lambdoid bacteriophages, and terminal repeat 
sequences recognized by transposases. Thus, the present 
invention also includes methods for the combinatorial 
generation of phenotypes, including a method of producing 
15 a nucleic acid vector encoding tvo or more desired genes 
each from a family of genes, said genes being capable of 
producing a characteristic that can be used to identify 
the vector encoding said genes from other vectors encoding 
other members of the families of genes, which method 

20 comprises: 

a) randomly inserting into vectors one member of a 
first family of genes and one member from, one or more 
other families of genes so that a population of vectors 
are created wherein each vector may contain one of the 

25 genes from said first gene family and one of the genes 
from each of said other gene families; 

b) identifying within said population of vectors a 
vector capable of detectably producing a desired charac- 
teristic resulting from the inclusion of one gene from 

30 said first gene family and one gene from each of said 
other gene families, and using said characteristic to 
distinguish the' vector from other vectors within the 
population containing undesired combinations of gene 
members from said gene families. 

Suitable vectors for use according to the methods of 
the present invention include plasmid or cosmid vectors 
or , alternatively, phage vectors. Suitable host cells for 


35 


SUBSTITUTE 


SHEET 


WO 91/16427 


PCT/US91/02910 


expressing the vectors comprise either ukaryotic cells or 
prlfcaryotic cells. Preferred euXaryotic cells include 
proitciiryuuAw , » sweet, the vectors 

mammalian cells. In one preferred aspect, 
comprise lambda bacteriophage and host cells comprise E. 

""'preferably, the genes are oo-M»d^U*Bj_ 

^j^^-^jj^^^StSiaiWbe used for the identl 


15 


20 


30 


Various suiwwc *u^w* — J % _ , . 

fication of a particular vector «^ to 
vector population. These methods include a) the inter 
I^ion of seouence-specific nucleic ^J^Z Z 
tne individual families which vere combined: (b) the 
nyb ridisation of nucleic acid probes with genes from the 
,ene families, (O the expression of on. or>oth genes 
Irom the gene families as an RMA molecule, and (d) the 
STestion of one or both genes as an identifiable protein 
Seoul., optionally, such an identifiable protein 
^ ecule may contain a binding site for another molecule 
an epitope recognised by » "f^"^ 
.olecule binding site for an epitope. ! ' 
identification method, both genes express an 
polypeptide and said RNAs and/or polypeptides physically 
with a host to create an identifiable character- 
istic Both genes may express polypeptides that physic- 
interact to form a neo-epitope receded by an 
™ molecule or polypeptides that physical* .interact 
to form a binding site for another molecule. <**°™»" 
!Lse polypeptides are derived from antibody genes such 
^aTthe interaction of both polypeptides forms an antigen 


35 


binding site. vectors produced 

in another preferred aspect, the vec t v 
according to the present invention contain a single 
" meter that expresses the genes from the gene families 
Alternatively, the genes from the gene families are each 
expressed fro* their own promoter. „ on *.* nn 
in a still further embodiment, the present invention 
contemplates the creation of combinations f two or more 
Nucleotide seguenc families (or populations, by iD^i^ 
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20 


10 

recombination. Such ?n v^ro recombination could be 
carried out using specific recombination target sequences 
and specific recombinases (like flp recombinase) , or by 
using homologous sequences shared by both nucleotide 
sequence populations to facilitate homologous 
'recombination. ., 

One method to accomplish a form of homologous 
recombination vitro is by using U vitro nucleic acid 
amplification methods such as the polymerase chain reac- 
tion (PCB ) . If both of two populations of DNA sequences 
share a region of homology, then it is possible during the 
PCR for base-pairing to occur between single stranded 
nucleic acid molecules from both populations of nucleotide 
sequences. If such base pairing creates a -primer- 
template complex" that can be used by a polymerase to 
begin synthesis of complementary strands, then a fusion 
product is created which will contain sequences from both 
nucleotide sequence populations (See Figure 21 here) . Jf_ 
~^ ne shared region of homology is p r esent on most or all o f. 
■Z^T^r, nnrieotide se quence populations^ t hpn most nr all 
of the nucleotd 
'recombination « 


Lde sequences 


can 


particij 


w ««*..»w*w... , - Inatorial population of fusion 

"=n^cleotide sequences can be produced, and subsequently 
inserted into a single expression vector for expression of 
25 the nucleotide sequence from both sequence families. Such 
a combinatorial population of expressed sequences can then 
be~lcr^ed~fo~r new phenotypes that would not be present 
if the sequences from only one population of nucleotide 
sequences were expressed, and would be present only with 
30 expression of particular combinations comprising a nucleo- 
tide sequence from each population. For example, such 
phenotypes could comprise the creation of heterodimeric 
proteins where one subunit of the dimer is encoded by one 
nucleotide sequence family and the other subunit of the 
35 dimer is encoded by the other nucleotide sequence family. 

Thus, the present invention is directed to methods of 
creating diversity, namely populations of diverse r plicas 
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of nucleotide sequences which may be co-bin*, to give a 
^versity of phenotypes, from which a ^/T^E 
» ay he selected. Such diversity may be generated starting 
w £h a single DNA molecule which is treated to create 
diversity, such as by mutagenesis or by starting with a 
family of nucleotide sequences (or genes) or a 


Combinatorial library. 

For example, one may start with a plasmld containing 
antibody sequences coding for both a light chain and a 
10 £vy Lin which has been isolated from a *nown 
Lnoclonal-antibody producing cell line. The nucleotide 
sequences coding for the light chain and the heavy ch«n 
Z?»* individually amplified (using a method such as FCK> 
- under conditions that mutated sequences 

15 create a population of mutated sequences. The individual 
" populations of mutated sequences may be used to ma.ce com- 
binatorial libraries which are then used to create novel 
pnenotypes. Alternatively, these individual populations 
£ mut"ed sequences may be combined using 
,o as fusion polynucleotide amplification (for example) 
20 fusion PC* (as described herein, and used to generate 
novel phenotypes. These novel phenotypes may include 
antibodies having enhance d antigen binding 

25 ^tZllZ:. another aspect of the present invention 
one or more genetically distinct phage may be Really 
replicated, conditions which are somewhat mutagenic, to 
generate a population (s, of diverse phage. 
Phenotypes distinct from the originals may be generated by 

30 cteavag! such as by a restriction endonuclease, ollowed 
by mixing of phage populations, and ligation, 
selection for expression of desired phenotypes. In this 
„ay phage having diverse phenotypes distinct from the 
parental phage may be generated combinatorial!^. 

« in another embodiment, the methods are utilized to 

produce novel human antibody-expressing DHA sequences, 
rlrst, an immunoglobulin heavy chain variable region V, 
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gene library containing a substantial portion of the V H 
gene repertoire of a vertebrate is synthesized. In pre- 
ferred embodiments, the V M -coding gene library contains at 
least about 10 s and more preferably at least about 10 and 
a^epreferabj^a^least^^ 

nucleic acid strands referred to herein as V„-coding DNA 
homologs. 

The gene library can be synthesized by various 
methods, depending on the starting material. Where the 
10 starting material is a plurality of V H -coding genes, the 
repertoire is subjected to two distinct primer extension 
reactions. The first primer extension reaction uses a 
first polynucleotide synthesis primer capable of initiat- 
ing the first reaction by hybridizing to a nucleotide 
15 sequence conserved (shared by a plurality of genes) within 
the repertoire. The first primer extension reaction 
produces a plurality of different V H -coding homolog 
complements (nucleic acid strands complementary to the 
genes in the repertoire). The second primer extension 
20 reaction produces, using the complements as templates, a 
plurality of different V H -coding DNA homologs. The second 
primer extension reaction uses a second polynucleotide 
synthesis primer that is capable to initiating the second 
reaction by hybridizing to a nucleotide sequence conserved 
25 among a plurality of V H -coding gene complements. 

Where the starting material is a plurality of 
complements of different V H -coding genes provided by a 
method other than the first primer extension reaction, the 
repertoire is subjected to the above-discussed second 
30 primer extension reaction. That is, where the starting 
material is a plurality of different V H -coding gene 
complements produced by a method such as denaturation of 
double strand genomic DNA, chemical synthesis and the 
like, the complements are subjected to a primer extension 
35 reaction using a polynucleotide synthesis primer that 
hybridizes to a plurality of the different V H -coding gen 
complements provided. Of course, if both a repertoire of 
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V-coding genes and their cedents are present in the 
starting material, both approaches can be used xn 

combination. _ , . „ ^ OT . a 

A V„-coding DNA hoBolog, 1^, a 9«ne coding for a 


^"^^tne-n^o produce the isolated 
U rn, be accoBplished by operative!* linking f or 
expression a plurality of the different V -coding DHA 
nolologs of the library to an expression vector. The V K - 

l0 e^resllon vectors so produced are introduced into a popu- 

1 talion of compatible host cells, i.e.. cells «pable^o 
expressing a gene operatively linked for expression to the 
^tor. The transformers are cultured under =°>>««°- 
for expressing the receptor ceded for by the V.-codxng DNA 

l5 no»olog. The transients are cloned and the clones are 
screened for expression of a receptor that binds the pre- 
ssed ligand. Any of the suitable methods well known 
i,, the art for detecting the binding of a Ixgand to a 
receptor can be used. A transfor»ant expressing the 

20 desired activity is then segregated from the population to 
oroduce the isolated gene. . 

A receptor having a preselected activity produced by 
a method of the present invention, preferably a V, or F v as 
described herein, is also contemplated. 

The present invention also encOBpasses P«*"** 
produced by the methods of the invention, such as the 
biological agents produced thereby, also the expression 
"relets of these methods such as polypeptides and nucleic 
acids, vectors produced and kits comprising any of the 

30 products of the claimed methods. 

mirf r-r-r'rt?"" °f the Prayings 

K I„ the drawings forming a portion of this disclosure, 
p <m ,re 1 illustrates a scheBatic diagram of the 
tamunoglobulin molecule shewing th principal 
35 Htur s. The circled area on the h avy chain repr sents 

Le variable region (V„, , a polyp ptide containing a 
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biologically active (ligand binding) portion of that 
region, and a gene coding for that polypeptide, are 
produced by the methods of the present invention. 
Sequences L03, L35, L47 and L48 could not be classified 
into an y predefined subgroups 


X f ^ — — . „ --^ i — — ~ ■ — -' — ~ 

Figure 2A is a diagrammatic sketch of an H chain of 
human IgG (IgGl subclass). Kumbering is from the N- 
terminus on the left to the C-terminus on the right. Kote 
the presence of four domains, each containing an intra- 
10 chain disulfide bond (S-S) spanning approximately 60 amino 
acid residues. The symbol CHO stands for carbohydrate. 
The V region of the heavy (H) chain (V H ) resembles V L in 
having three hypervariable CDR (not shown) . 

Figure 2B is a diagrammatic sketch of a human K chain 
15 (Panel 1). Numbering is from the N-terminus on the left 
to the C-terminus on the right. Note the intrachain 
disulfide bond (S-S) spanning about the same number of 
amino acid residues in the V L and q domains. Panel 2 
shows the locations of the complementarily-determining 
20 regions (CDR) in the V L domain! Segments outside the CDR 
are the framework segments (FR) . 

Figure 3 depicts the amino acid sequence of the V H 
regions of 19 mouse monoclonal antibodies with specificity 
for phosphorylcholine. The designation HP indicates that 
25 the protein is the product of a hybridoma. The remainder 
are myeloma proteins. (From Gearhart et al., Nature , 

291:29, 1981.) 

Figure 4 illustrates the results obtained from PCR 
amplification of mRNA obtained from the spleen of a mouse 

30 immunized with FITC. Lanes R17-R24 correspond to ampli- 
fication reactions with the unique 5- primers (2-9, Table 
1) and the 3* primer (12, Table 1), R16 represents the PCR 
reaction with the 5' primer containing inosine (10, Table 
1) and 3- primer (12, Table 1). Z and R9 are the ampli- 

35 fication controls; control Z involves the amplification of 
V for a plasmid (PLR2) and R9 represents the amplification 
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from the constant regions of sple n mKNA using primers 11 

and 13 (Table 1) . ' 

viaure 5 depicts nucleotide sequences of clones from 
the cDNA library of the PGR amplified V„ regions in Lambda 
y ZAP vector. The N-terminal 110 bases are listed here and 

the underlined nucleotides represen^O^^ 
— determining^regionJT - — . 

«m 6B depict the sequence of the 

synthetic DNA insert inserted into Lambda ZAP vector to 
produce Lambda Zap II V H (6A) and Lambda Zap V L <6B) 
expression vectors. The various features required for 
this vector to express the V H and V L -coding DNA homologs 
include the Shine-Dalgarno ribosome binding site, a leader 
sequence to direct the expressed protein to the periplasm 
as described by Mouva et al., J. ploy, Chem .. , , 255:27, 
19 80, and various restriction enzyme sites used to opera- 
tive^ link the V K and V t homologs to the expression 
vector. The V„ expression-vector sequence also contaxns a 
short- nucleic acid sequence that codes for amino acids 
typically found in variable regions heavy chain <V H 
Backbone) . This V H Backbone is just upstream and in the 
proper reading as the V H DNA homologs that are operatxvely 
linked into the Xho I and Spe I. The V L DNA homologs are 
operatives linked into the V L sequence <6B) at the Nco I 
and Spe I restriction enzyme sites and thus the V H Backbone 
region is deleted when the V L DNA homologs are operatively 
linked into the V t vector. 

rWe 7 depicts the major features of the bacterial 
expression vector Lambda Zap II V H (^-expression vector) 
are shown. The synthetic DNA sequence from Figure .6 is 
shown at the top along with the T 3 polymerase promoter from 
Lambda Zap II vector. The orientation of the insert in 
Lambda Zap II vector is shown. The V H DNA homologs are 
inserted into the Xho I and Spe I r striction enzyme 
35 sites. The V H DNA are inserted into th Xho I and Spe I 
sit and the read through transcript! n produces the 
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decapeptide epitope (tag) that is located just 3 • of the 

cloning sites. 

Figure 8 depicts the major features of the bacterial 
expression vector Lambda Zap II V t <V t expression vector) 
are shown. The synthetic sequence shovm in Figure 6B is 
shown at the top along with the T 3 polymerase promoter from 
Lambda Zap II vector. The orientation of the insert in 
Lambda Zap_vector^is_shown^_^ V.^^pmologs^re 


15 

TG 


inserteTinto the phagemid that is produced by the in v^vo 
excision protocol described by Short etal., Nucleic Acids 
ggs^., 16:7583-7600, 1988. The V t DNA homologs are inserted 
into the Nco I and Spe I cloning sites of the Phagemid. 

Pi oure 9. depicts a modified bacterial expression 
vector Lambda Zap II V L II. This vector is constructed by 
inserting this synthetic DNA sequence, 
lAATTCTAAACTAGTCGCCAAGGAGACAGTCATAATGAA 

TCGAACTTAAGATTTGATCAGCGGTTCCTCTGTCAGTATTACTT 
ATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCTG 

TATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGAC 
20 CCCAACCAGCCATGGCCGAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 

GGGTTGGTCGGTACCGGCTCGAGCAGTCAAGATCTCAATTCGCCGGCAGCT 

into Lambda Zap II vector that has been digested with the 
restriction enzymes Sac I and Xho I. This sequence 
contains the Shine-Dalgarno sequence (ribosome binding 
25 site) , the leader sequence to direct the expressed protein 
to the periplasm and the appropriate nucleic acid sequence 
to allow the V L DNA homologs to be operatively linked into 
the Sad and Xbal restriction enzyme sites provided by 
this vector. 

30 TH«m- e 10 depicts the sequence of the synthetic DNA 

segment inserted into Lambda Zap II vector to produce the 
lambda v L II-expression vector. The various features and 
restriction endonuclease recognition sites are shown. 

figure 11 depicts the vectors for expressing V H and V L 

35 separately and in combination. The various essential 
compon nts f these vectors are shown. The light chain 
v ct r or V L expression vector can be combined with the V H 
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expression vector to produce a combinatorial vector con- 
taining both V H and V L operatives linked for expression 

to the same promoter. 

Figure 12 depicts the labelled proteins immuno- 
precipitated fro^E^coii^containing-a-V^and-a-V^DNA^ 
1^og~are~shown. In lane 1, the background proteins 
inununoprecipitated from E^ccOi that do not contain a V or 
V L DNA homolog are shown. Lane 2 contains the V H protein 
iUunoprecipitated from E^sU containing only a V, DNA 
homolog. in lanes 3 and 4, the emigration of a V„ 
protein a V L protein immunoprecipitated from p. coU 
containing both a V H and a V L DNA homolog is shown. In 
lane 5 the presence of V H protein and V t protein expressed 
from the V H and V L DNA homologs is demonstrated by the two 
distinguishable protein species. Lane 5 contains the 
background proteins immunoprecipitated by anti-E^sU 

antibodies present in mouse ascites fluid. 

r^ re 13 depicts the transition state analogue 
(formula 1) which induces antibodies for hydrblyzing 
carboxamide substrate (formula 2). The compound of 
formula 1 containing a glutaryl spacer and a N- 
hydroxysuccinimide-linker appendage is the form used to 
duple the hapten (formula 1, to protein carriers ^ and 
BSA, while the compound of formula 3 is the inhibitor. 
The phosphonamidate functionality is a mimic of the 
stereoelectronic features of the transition state for 
hydrolysis of the amide bond. 

UgureJA illustrates the PCR amplification of Fd and 
Kappa regions from the spleen mRNA of a mouse immunized 
with NPN. Amplification was performed as described in 
Example 17 using ENA cDNA hybrids obtained by the reverse 
transcription of the mRNA with primer specific for ampli- 
fication of light chain sequences (Table 2, or heavy chain 
sequences (Table 1) . Lanes F1-F8 represent the product of 
heavy chain amplification reactions with ne of each of 
L^ight 5. primers (prim rs 2-9, Tabl, 1) and the uniqu 
3 . primer (primer 15, Table 2). Light chain (k> amplifi- 
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cations with the 5- primers (primers 3-6, and 12, respect- 
ively, Table 2) are shown in lanes F9-F13. A band of 700 
bps is seen in all lanes indicating the successful 
amplification of Fd and k regions. 

viaur-e 15 depicts the screening of phage libraries 
for antigen binding is depicted according to Example 17C. 
Duplicatep^xeJ^ts^ 

-f^ers-^FFand-Tigb^^ G,H) expression 

libraries were screened against ^I-labelled BSA conjugated 
with NPN at a density of approximately 30,000 plaques per 
plate. Filters C and D illustrate the duplicate secondary 
screening of a cored positive from a primary filter A 
(arrows) as discussed in the text. 

Screening employed standard plaque lift methods. XL1 
Blue cells infected with phage were incubated on 150mm 
plates for 4 hours at 37 -C, protein expression induced by 
overlay with nitrocellulose filters soaked in lOmM isopro- 
pyl thiogalactoside (IPTG) and the plates incubated at 25- 
for 8 hours. Duplicate filters were obtained during a 
20 second incubation employing the same conditions. Filters 
were then blocked in a solution of 1% BSA in PBS for 1 
hour before incubation with rocking at 25' for 1 hour with 
a solution of 125 I-labelled BSA conjugated to NPN (2 x 10 
cpm ml* 1 ; BSA concentration at 01 M; approximately 15 NPN 
per BSA molecule) in 1% BSA/PBS. Background was reduced 
by pre-centrif ugation of stock radiolabeled BSA solution 
at 100,000 g for 15 minutes and pre-incubation of solu- 
tions with plaque lifts from plates containing bacteria 
infected with a phage having no insert. After labeling, 
filters were washed repeatedly with PBS/0.05% Tween 20 
before development of autoradiographs overnight. 

Venire 16 depicts the specificity of antigen binding 
as shown by competitive inhibition is illustrated accord- 
ing to Example 17C. Filter lifts from positive plaques 
were exposed to 12S I-BSA-NPN in the presence of increasing 
concentrations of the inhibitor NPN. 
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in this study a number of phages correlated with NPN 
binding as in Figure 15 were spotted (about 100 particles 
per spot) directly onto a bacterial lawns. The plate was 
then overlaid with an IPTG-soaked filter and incubated for 
19 hours at 25'. The filter were then blocked in 1% BSA 
in PBS prior to incubation "in ^I-BSA-NPN as described 
previously in Figure 15 except with the inclusion of vary- 
ing amounts of NPN in the labeling solution. Other 
conditions and procedures were as in Figure 15. The 
results for a phage of moderate affinity are shown in 
duplicate in the figure. Similar results were obtained 
for four other phages with some differences in the 
effective inhibitor concentration ranges. 

F <m ,r e 17 depicts the characterization of an antigen 
binding protein is illustrated according to Example 17D. 
The concentrated partially purified bacterial supemate of 
an NPN-binding clone was separated by gel filtration and 
aliquots from each fraction applied to micrbtitre plates 
coated with BSA-NPN . Addition of either anti-decapeptide 
( , or anti-kappa chain antibodies conjugated with alka- 
line phosphatase was followed by color development. The 
arrow indicates the position of elution of a known Fab 
fragment. The results show that antigen binding is a 
property of 50 kD protein containing both heavy and light 
25 chains. 

Single plagues of two-NPN-positive clones (Figure 15) 
were picked and the plasmid containing the heavy and light 
chain inserts excised. 500 ml cultures in L-broth were 
inoculated with 3 ml of a saturated culture containing the 
excised plasmids and incubated for 4 hours at 37 *C. 
Proteins synthesis was induced by the addition of IPTG to 
a final concentration of imM and the cultures incubated 
for 10 hours at 25'C. 200 ml of cells supemate were 
concentrated to 2 ml and applied to a TSK-G4000 column; 
50 til. aliquots from the eluted fracti ns were assayed by 
ELISA. 
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For ELISA analysis, microtitre plates were coated 
with BSA-NPN at 1 ug/ml, 50 nl samples mixed with 50 m 
PBS-Tween 20 (0.05%) -BSA (0.1%) added and the plates 
incubated for 2 hours at 25'. After washing with PBS- 
-^ien-2-(PBSA— 50-Ml-of-appropriate^conce ntrations of a 


Tween ^u-dsa, ~- -x-x-- * 

rabbit anti-decapeptide antibody (20) and a goat anti- 
mouse kappa light chain (Southern Biotech) antibody 
conjugated with alkaline phosphatase were added and 
incubated for 2 hours at 25'. After further washing, 50 
10 M l of p-nitrophenyl phosphate (tag/ml in 0.1M Tris pH 9.5 
containing 50 mM MgCl 2 ) were added and the plates incubated 
for 15-30 minutes before reading the OD at 405nm. 

figure ISA depicts the major features of the 
bacterial expression vector HCFLP containing a V H DNA 
15 homolog and a flp recombination site. 

Figure 18B depicts the major features of the 
bacterial expression vector LCFLP containing a V L DNA 
homolog and a flp recombination site properly oriented for 
recombination with the HCFLP vector. 
20 Figure 19 depicts a diagrammatic sketch of bacterial 

coinfection with HCFLP and LCFLP vectors for the produc- 
tion of recombinant expression vectors containing V L and V„ 
DNA homologs. 

Figure 20 depicts an outline showing arm selection 
25 for heavy and light chain recombinant vector products 
using flp recombinase in conjunction with selection based 
on the inclusion of genes having amber mutations . 

Figure 21 shows ah outline of a method of phenotype 
creation using the fusion PCR process described herein. 
30 Figure 22 illustrates human fusion PCR inside 

primers. The heavy chain inside primer sequence is 

written 3« to 5« and the light chain V L inside primer, 
sequence is written 5- to 3«, Note that it is not the 
primer strands that cross-prime to create the fusion 
35 mol cule, but the complementary PCR product strands. 
Boxed nucleotides r present r gions where the c.l« primer 
hybridizes to the 3 • end of C„l on human IgG heavy chain 
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mRNA or where the V L primer hybridizes to the 5- end of V L 
framework-1 on human kappa light chain cDNA. Underlined 
sequences indicate the two stop condons. The italicized 
amino acid and nucleotides indicate changes in sequence 
5 from the original pelB leader sequence. The mouse fusion- 
PGR internal primers overlap in a similar manner. 

Figure 23 illustrates an ethidium bromide stained 

agarose gel. After-PCR- amplification from human cloned __ 

DNA of heavy chain alone (HC) , light chain alone (LC) , and 

10 the heavy/light dicistronic DNA molecule (H/L) , DNA sam- 
ples were electrophoresed. The expected sizes of the HC, 
LC, and H/L products visualized on the gel were approxi- 
mately 730, 690, and 1,390 base pairs, respectively. 

Figures 24A and 24B illustrate the major features of 

15 the bacterial expression vector Lambda ZAP II Modified V H 
(Modified ImmunoZAP H) (V„-expression vector) (IZ H) . The 
amino acids encoded by the synthetic DNA sequence from 
Figure 24A is shown along with the T 5 polymerase promoter 
from Lambda ZAP II. The orientation of the insert in 

20 Lambda ZAP II is as presented. The insert was modified by 
the elimination of the Sac I site between the T 3 polymerase 
and Not I site and by the change of amino acids at the 5* 
end of the heavy chain from QVKL to QVQL (alysine residue 
was changed to a glutamine residue) . The V„ and V L DNA 

25 homologs were inserted into the Xho I and Xba I cloning 
sites of the phagemid as described in Figure 26 and shown 
in Figure 24B. The modifications were made to create a 
fusion-PCR library from hybridoma RNA, to overcome 
decreased efficiency of secretion of positively charged 

30 amino acids in the amino terminus of the protein. Inouye 
etal., ^ * A ~ US *» 85:7685-7689 (1988), 

and to. make the V L Sac I cloning site a unique restriction 
site. 

Figures 25A and 25B illustrat the sequences of the 
35 synthetic DNAs inserted into Lambda ZAP to produce Lambda 
Zap II V„ (ImmunoZAP H) (25A) and Lambda Zap V L (ImmunoZAP 
L) (25B) expr ssion vectors. The various features 
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required for these vectors to express the V H and V L -coding 
DNA -homologs include the Shine-Dalgamo ribosome binding 
site, a leader sequence to direct the expressed protein to 
the periplasm as described by Mouva et al., jr. pjoli 
5 chem. . 255:27, 1980, and various restriction^enzyine- sites 
used to operatively link the V H and V L homologs to the 
expression vector. The V H expression-vector sequence also 
contains a short nucleic acid sequence that codes for 
amino acids typically found in variable regions of the 

10 heavy chain (V H Backbone) . This V H Backbone is just 
upstream and in the proper reading frame as the V H DNA 
homologs that are operatively linked into the Xho I and 
Spe I restriction sites. The V L DNA homologs are opera- 
tively linked into the V L sequence (25B) at the Sac I and 

15 Xba I restriction enzyme sites. 

Figure 26 illustrates the major features of the 
bacterial expression vector Lambda Zap II V K (ImmunoZAP H) 
(V H - expression vector) . The amino acids encoded by the 
synthetic DNA sequence from Figure 25A is shown at the top 

20 along with the T 3 polymerase promoter from Lambda Zap II. 
The orientation of the insert in Lambda Zap II is as pre- 
sented. The V H DNA homologs were inserted into the 
phagemid that is produced by the in vivo excision protocol 
described by Short et al,, Nucleic A cids Res,, 16:7583- 

25 7600, 1988, The V H DNA homologs were inserted into the 
Xho I and Spe I restriction enzyme sites. The read 
through transcription produces the decapeptide epitope 
(tag) that is located just 3 1 of the cloning sites. 

Figure 27 illustrates the major features of the 

30 bacterial expression vector Lambda Zap II V L (ImmunoZAP L) 
(V L expression vector) , The amino acids encoded by the 
synthetic DNA sequence shown in Figure 25B is shown at the 
top along with the T 3 polymerase promoter from Lambda Zap 
II. The orientation of the insert in Lambda Zap II is as 

35 presented. The V L DNA homologs are insert d into the Sac 
I and Xba I cloning sites of the phagemid as described in 
Figure 26. 
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• D _ autoradiogram showing 

. previously soaXed *n XO m ^ ^ssim^mm-^ 

^*™°°"°-<*™^ t0 -^;£?-7ere reacted with «X- 
OT emi 9 bt inc^ion ^ ^ £ilte rs were 

tetanus toxoid probe. £ represents 
exposed to X -r.v til«._^col ^ ^ 

10 tbe parental clones that were , el- , 
.orial library. »U* etal.. ^f^T^ 
m , 87:8095-8099 (1990). ^ ^ a-plifyina, the 

S ents clones that were grated by 
con binatoriel la^da clone DHA witb ^ e ^ ' recloning 
„ i i =r,rj v inside primers, followed oy 

15 primers, C H 1- and V L inside P ^ ■ ?G1 is a nega _ 

in the modified ImmunoZAP H vector ^ 
tive control which ^ ^ ^ 

with tetanus toxoid. Clones w edified 
F abs that react with tetanus toxoid. IZ H is « 
20 Te^vy chain ImmunoZAP B vector without an xnsert. 


prtnllr^ n oe ^ r <Ptio 
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As used herein, _ tated to the contrary: 

following menings unless e ^ eSSl f „ ^ consis t- 
Nucleotide: a monomeric unit of DNA or k*h 

stta — asV rz--.r. 

pentose, and th»t . ph ospbate 

SSI- tlTe ror-r/osition o £ tbe pentose it is 

* nine ia) with thymine (T) , or of cytosine (C) vaxn 
^aninl (0) m a double stranded .HA molecule. Xn ' 
uracil (U) is substituted for thymine. 
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Nucleic Acid : a polymer of nucleotides, either 
single or double stranded. 

Gene ; a nucleic acid whose nucleotide sequence codes 
for an RNA or polypeptide. A gene can be either RNA or 
5 DNA. 

Complementary Bases : nucleotides that normally pair 
up when DNA or RNA adopts a double stranded configuration. 

Complementary Nucleotide Sequence ; a sequence of 
nucleotides in a single-stranded molecule of DNA or RNA 
10 that is sufficiently complementary to that on another 
single strand to specifically hybridize to it with 
consequent hydrogen bonding. 

Conserved; a nucleotide sequence is conserved with 
respect to a preselected (reference) sequence if it non- 
15 randomly hybridizes to an exact complement of the 
preselected sequence. 

Hybridization; the pairing of substantially 

complementary nucleotide sequences (strands of nucleic 
acid) to form a duplex or heteroduplex by the establish- 
ed ment of hydrogen bonds between complementary base pairs. 
It is a specific , i.e. non-random, interaction between two 
complementary polynucleotides that can be competitively 
inhibited. 

Nucl eotide Aha 1 oa ; a purine or pyrimidine nucleotide 
25 that differs structurally from A, T, G, C, or U, but is 
sufficiently similar to substitute for the normal nucleo- 
tide in a nucleic acid molecule. 

DNA Homolocr; is a nucleic acid having a preselected 
conserved nucleotide sequence and a sequence coding for a 
30 receptor capable of binding a preselected ligand. 

Receptor; A receptor is a molecule, such as a 
protein, glycoprotein and the like, that can specifically 
(non-randomly) bind to another molecule. 

Antibody; The term antibody in its various grammati- 
35 cal forms is used herein to refer to immunoglobulin 
molecules and immunologically active portions f immuno- 
globulin molecules, i.e., molecules that contain an 


SUBSTITUTE SHEET 


WO 91/16427 


PCT/US91/02910 


25 


10 


15 


20 


25 


30 


35 


antibody combining site or paratope. Exemplary antibody 
molecules are intact immunoglobulin molecules, substan- 
tially intact immunoglobulin molecules and portions of an 
immunoglobulin molecule, including those portions known in 
the art as Fab, Fab', F(ab') 2 and F(v) . 

an^hndv comtt^na Site ; An antibody combining site 
is that structural portion of an antibody molecule com- 
prir ~* a heav y and light chain variable and hypervari- 
able regions that specifically binds (immunpreacts with) 
an antigen. The term immunoreact in its various forms 
means specific binding between an antigenic determinant- 
containing molecule and a molecule containing an antibody 
combining site such as a whole antibody molecule or a 

portion thereof. 

pporianal antibody ; The phrase monoclonal antibody 
in its various grammatical forms refers to a population of 
antibody molecules that contains only one species of anti- 
body combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with 
which it immunoreacts. A monoclonal antibody may there- 
fore contain an antibody molecule having a plurality of 
antibody combining sites, each immunospecific for a 
different antigen, e.g., a bispecific monoclonal antibody. 

upstream ; In the direction opposite to the direction 
of DNA transcription, and therefore going from 5 ' to 3' on 
the non-coding strand, or 3 • to 5- on the mRNA. 

nownstream ; Further along a DNA sequence in the 
direction of sequence transcription or read out, that is 
traveling in a 3'- to 5 '-direction along the non-coding 
strand of the DNA or 5'- to 3 '-direction along the RNA 
transcript. 

cistron ; Sequence of nucleotides in a DNA molecule 
coding for an amino acid residue sequence. 

stop codon : Any of thr e codons that do not cod for 
ah amino acid, but inst ad cause terminati n of protein 
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synthesis. They are UAG, UAA and UGA. Also referred to 
as a nonsense or termination codon. 

T^r^-r Polypeptide ; A short length of amino acid 
sequence at the amino end of a protein, which carries or 
5 directs the protein through the inner membrane and so 
ensures its eventual secretion into the per iplasmic space 
and perhaps beyond. The leader sequen ce p eptide is 
" — commonly removed before the proteitrbecomes^active. 

p^rHrvr Frame ; Particular sequence of contiguous 
10 nucleotide triplets (codons) employed in translation. The 
reading frame depends on the location of the translation 

initiation codon. 

7 P «HrtA Primer : An inside primer is a polynucleotide 
that has a priming region located at the 3 ' terminus of 
15 the primer which typically consists of 15 to 30 nucleotide 
bases. The 3' terminal-priming portion is capable of 
acting as a primer to catalyze nucleic acid synthesis. 
The 5 '-terminal priming portion comprises a non-priming 
portion. 

20 nnttHda Primer ; An outside primer comprises a 3«- 

terminal priming portion and a portion that may define an 
endonuclease restriction site which is typically located 
in a 5 '-terminal non-priming portion of the outside 
primer. 

25 pmHon Pol vrmcleoti ^ fmpH*l cation: refers to in 

vitro techniques of generating a multiple complementary 
copies of a nucleic acid template which comprises nucleo- 
tide sequences which have been randomly combined to give 
a combined nucleic sequence. These techniques typically 

30 employ complementary" primers which hybridize to the 
template and are extended in a primer extension reaction. 
The polyumerase chain reaction (PCR) techniques described 
herein comprise a preferred method of nucleotide sequence 
amplifications. Generation and amplification of a 

35 combined nucleotide sequence using fusion PCR is further 
described herein. 
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Ar> the tern "vector" refers to 
^ used herein the tern 

a nucleic acid ^^ C ^' a £ t £ r 'oleic acid to 
dif ferent genetic one type of P~- 

which it has been operatxveiy ^ Bol£Cule 

5 ferred vector is an epxsone, i.e. ' ^ suitable 

capable of extra-cnro^r^^ ^ ^ 

vectors include pl asmxd and cosmid . ^ 

and/oVexpres^ 
are those capable of autono* _ ^ Vectors 

si on of nucleic acids to which thenar 

capable of directing the tt nerei n as 

arc operatives linked are referred 

"expression vectors". 

B. MetHods „ or , et ic engineers . typically 

, Until this ^f-'^H^e g 9 ene or family (=r 

de alt with the expression f ^ expresslon of a 
population, of genes, one a to as a 

family of genes in a ™ctor 9 ^ normally 
,, g ene library." Bach member <***^ However . the 
„ contain a different gene or J** seen. typlcal iy 
vector portion of «^ a veotorg-e ^ ^ _ 

identical from member «V _ ^ ^ ^ o£ten 

be . and typically are, amp topUflc ation 

:5 identify and •»» " » bMteri01 

occurs so that each l^rary - (for 

or Phage particles. _ olone containing a single 

Th e search for a be accomplished in 

g ene or DHA sequence of a b mase 

« aiKtt r^="caUr^"S with a nucleic 
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RNA species that can be identified, for example by nucleic 
acid hybridization. The RNA species may, furthermore, be 
translated into a protein, typically by the host cell, 
that may be identified, for example, by reactivity with an 
~5~^antibody prober— Alternatively^ the_protein may be recog- 
nized because it binds a substrate, or catalyzes a 
reaction, or allows the host cell to survive under 
selective conditions, and so on. 

Described herein are libraries in which two or more 
10 families (or populations) of genes are expressed in a 
vector or a host cell in such a way that the gene combi- 
nations are randomly represented and subsequently detected 
on the basis of some property or characteristic in the 
event that a particular combination of one member from a 
15 first gene family and one gene from a one or more other 
gene families are combined in a vector host cell. For 
example, in the general case if there are "i" members of 
the gene family "A" and w j n members of the gene family 
"B", there will be (i) x (j) combinations of selected gene 
20 members A and B in the randomly created vector-gene 
population. If there are three gene families, A, B, and 
C, and a vector is made containing one member from each of 
the three gene families, the total number of combinations 
of genes will be the product of the number of A genes 
25 times the number of B genes times the number of C genes. 
Thus, methods are provided wherein at least two genes may 
randomly be combined, preferably on- the same vector 
molecule, having been identified within a population of 
vectors containing other combinations of different genes 
30 from the same two or more gene families. This approach 
may be broadly accomplished by means other than recombi- 
nation, for example, the use of a vector having at least 
two independent insertion sites for two foreign genes or 
inserting in a vector a nucleotide sequence comprising 
35 nucleotide sequences from each gene family. The recombi- 
nation of at least two s parate library populations to 
make a combinat rial population, f r example, using a 
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common restriction site or site-directed recombination 
systems, is also contemplated. 

Thus, in addition to the above-described methods, the 
invention also provides for vectors having f 
5 and sequences useful for the preparation of combmatorx.l 
vectors encoding random DNA sequences from two or more 
g ene families. Sucn^ectorsjnclude^lasmds^andph^ 
--fortaininT^omW^estric^^ 

the in 313* recombination of said DNA sequences from said 

10 9ene ^firsite-specific recombination of ssraisise 
has been described in Cox, chapter 13 in "Genetic Recombi- 
nation," eds. R. Kucherlapati and G. smith (American 
Zlty for Hicrobiology . Within a ^ 

15 region identified as the recombination site and designated 
« (flP recombination target), there are several promi- 
nent structural features. The most important are a set of 
three bp repeats. The second and third repeats are separ- 
ated by one bp an* are in the same orientation. —J™* 

2 0 repeat is inverted with respect to the other two and is 
separated from the second repeat by an eight bp spacer. 
The first repeat also has a one bp mismatch relative to 
Z first two. Deletion analysis has 

.he third repeat is ™°">™ '""T'TtZj?- 


the tnira repeat - ^ 

vi^, although it »ay have a slight effect on the reac- 
^on"in vivo. Additional deletions indicate that most, 
but not all, of the first and second repeats (those 
spacer) are retired. While deletion of 


three bp from the distal ends of one or both of these 
■so repeats has no detectable effect on the reaction, further 
option leads to a gradual reduction in site function 
with complete less of site function occurring (^^. 
with deletions of eight bp or more from either end The 
ILimal site required for a full function in *to U 
35 before relatively small (approximately 28 bp includ ng 
Z spacer and the proximal 10 bp of each flanking 
. Accordingly, it will be seen that the full. 
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intermediate, or minimal FRT sequences can be utilized to 
accomplish flp-mediated site-specific recombination. 

The lambda phage attachment site is responsible for 
integration of lambda into the host chromosome. It also 
acts as a hot spot of recombination and lytic crosses 
between wild lambda chromosomes. As in lambda, in PI 
pha ge a site-specific cross over site, loxP acts as a hot 
spot of recombination. This site is recognized-bythe-Pl 
ere protein, a known site-specific protein. The site- 
specific recombination system is responsible for the rare 
integration of PI into the host chromosome. The cre-lox 
system of bacteriaphage PI is also useful for the site- 
specific recombination contemplated by the inventxon 
described and claimed herein. 

A transposbn can jump from one vector to another 
vector or from a vector to a bacterial chromosome. 
Different transposons having different inverted repeat 
sequences and carrying, for example, different drug- 
resistance genes, can be used to carry out the desired 
random combination of genes as described herein either in 
yiia or <n vitro . The transposon may, but need not, also 
contain a sequence encoding the transposase enzyme which 
catalyzes the "hop," Various suitable transposon systems 
have been described in the literature. (See, Mobile. 
Douglas E. Berg and Martha M. Howe, eds., American Society 
for Microbiology, Washington, D.C., 1989). One suitable 
transposon system is the gamma-delta transposon system 
which has been isolated from E. Coli. 

Thus, in addition to restriction digestion and 
ligation, use of flp type recombination systems, and 
homologous recombination, a transposon system can also be 
used to integrate a light (or heavy) antibody chain clone, 
into a heavy (or light) antibody chain clone. For 
example, this can be accomplished by flanking the light 
chain expression and cloning region with transposon 
terminal sequences. A library constructed in this light 
chain vector could be used to co-infect bacteria with 
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inserts between J*e r lnte other £or 

*» tU chain --can- ^ posl . 


in the presence of ^ clone ca n be a posi . 

placing a selec ^ansposon n ° pt \ nleete d culture 

'ioned ^ ee " , ^ecovere^ro^eeo-int'C ^ 
subsequently. P^J n selection * 

is pl-t- ' * light chain ""^j^ions of a 

ttls second Pla only one 1 da phage 

hl9h cell to P ha « e . ^ each cell. » ^ ^ 

^ 1CaUY " nl" « - contains 

Bb ould grow only i „ OE t e £flc eEsen - 

15 heavy and X, the -P ^ n ^ e wixl not 

«- tr ^e neavy chain in the proper- 

ti al genes of th lni „g the »anspoV A collection 

' rOW - OTlV i^ U -av, chain ^ f ^ iIiat orial heavy 
position vi*Wn ees a 1UJ rary of 

on of these clone v^odv clones. invention, 
20 and «** chain ant ibody ^ ^ p ^ nt » ^ ^ 

. uU « aerate « o ^ ^ 

fusion FCR I s ^ na ve one of " r egions of 

trB o.ents. each of>* ^ endsshar^ ^ 

. 5 w directed »isp«»xng ter »ini. * ■ .. 

eOT ple»entarxty, _ deJia tured and rea u<> to 

frB g»ents » e ^ e ' term ini on two strand ^ internally 

Ted The subsequent PC* cycle P molec ule that U 

30 prl ^ regions to form a hybri 

overlapping reg 21 , escri bed in detail in 

dicistronic *°* „ etll ods are f ^r^.soO.XSS. «• 

PCR T"os *.«"' 192 ' 4 ' £ \'te^s including 
0 .S. Patent W». ^ several r 0SA 

■technology: * ^ , 6t oc*to» 

M.pXifioatlon". 

...--mure s«eer 
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25 


(1989); and "PGR Protocols" A Guide to Methods and 
Applications", Innis et al., eds., Academic Press, San 
Diego, California (1990). 

Thus, in one aspect of the present invention, fusion 
PGR is used to produce a library of dicistronic DNA mole- 
cules- containing -upstream and downstream cistrons wherein 
first and second PGR amplification products are produced 
using respective first and second PGR primer pairs. The 
first PCR primer pair comprises a first polypeptide 
outside primer and a first polypeptide inside primer. 
Similarly, the second PCR primer pair comprises a second 
polypeptide outside primer and a second polypeptide inside 
primer. The first and second polypeptide inside primers 
contain complementary 5 • -terminal sequences that allow 
is their DNA complements to hybridize and form an internally- 
primed duplex having 3 • -overhanging termini. The 
internally-primed duplex is then subjected to primer 
extension reaction conditions to produce a double 
stranded, dicistronic DNA having substantially blunt or 
blunt ends. The dicistronic DNA is then PCR amplified 
using the outside primers as a PCR primer pair. 

The dicistronic DNA molecule comprises two amino acid 
residue-coding sequences on the same strand separated by 
at least one stop codon and at least one signal sequence 
necessary for translation of the downstream cistron, such 
as a translation initiation codon, ribosome binding site, 
and the like. Thus, the upstream and downstream cistrons 
of the dicistronic DNA molecule are operatively linked by 
a cistronic bridge. The cistronic bridge comprises the 
genetic elements necessary to terminate translation of the 
upstream cistron and initiate translation of the down- 
Btream cistron. For instance, the coding strand of the 
bridge codes for one or more stop codons, preferably two, 
in the same translational reading frame as the upstream 
cistr n. The cistronic bridge coding strand preferably 
also encodes a rib s me binding site for the dowstream 
cistron located downstream from the upstream cistron' s 
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st op codon(s). Typically, the coding strand of the 
chronic br dge will also encode a leader polypeptide 
:^t in the sa»e translational reading fra^ >~ the 
--^stre^istron^vmen-pre^^ 

,«™ the leader usually begins with an 
5 sequence encoding the le *° tive distance, 

initiation codon located within an op 
Tl is operatively linHed, to the ribosce binding site 
1 ihe following discussion illustrates «-.«-•-* 
fusi on PCR to 

" ^ZT LIZ'XT:^ T, illustrating an ^ *- 

mcn ber from each of two or m fami iies of 

illustrated method can be used witn o^n 
15 conserved genes which each for one unit of a dimeric 
15 conserved g directly from a natural source, 

receptor, whether obtained directly ir 

1 naive or in vivo immunized cells, or from cells or 


vitro . 
20 elements: 


ncs: 

1. Producing V„ and V L gene repertoires. 
2 Preparing sets of outside and inside polynucleo- 
tid e prLrs for cloning . polynucleotide segments 
containing immunoglobulin V H and V region genes. ^ 
, 5 3. Preparing a library containing a plurality of 

25 dif ferent dicistronic DHA molecules, each containing a V H 

aDdB ^ Mefr0 V he r^trt^ecule S in 
4. Expressing the dicistronic 

suitable host ceils. . 

5 screening the polypeptides expressed by the 
30 , , ^ fnr the preselected activity, 

dicistronic DNA molecules for the pre 

4.4 « » dicistronic DNA molecules for the 
TJ^T^X ^^ a dicistronic 
„ol cul identified by the screening process. . 

The present invention also provides a n vel 
method f r screening variants of a parental clone or 
« the parental clone or cl nes c ntain two 
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nucleotide sequences that, when expressed together, create 
a phenotype, then such nucleotide sequences can be altered 
to create populations of variants of such nucleotide 
sequences. If the two variant populations are coexpressed 
5 in a random fashion (that is with no correlation between 
the specific alterations Bade in the two different nucleo- 

tide seq uences ) , then a combinatorial collection of such 

nucleotide sequence variants has been created. Such com- 
binatorial collections may be screened for the presence 
10 of phenotypes that are unlike the parental clone or 
clones. Generally, the method combines the following 
elements: 

1. Replicating a clone containing a nucleotide 
sequence under conditions that allow mutations to occur. 
15 2. Replicating a second clone containing a second 

nucleotide sequence under conditions that allow mutations 

to occur. t 

3. Randomly combining and co-expressing the two 
mutated populations of nucleotide. sequences. 
20 4. screening clones containing combinations of 

mutated nucleotide sequences for phenotypes that were not 
present in either parent clone. 

Alternatively, the methods combine the following 

elements: 

25 i. Replicating at least portions of two nucleotide 

sequences contained within a single clone under conditions 
that allow mutations to occur in either nucleotide 

sequence. ^ 

2 Allowing recombination events, between the two 

30 nucleotide sequence populations to reassociate mutant 
nucleotide sequences to form new pairs of the two 
sequences that were not paired in the original mutated, 

replicated population. 

3 screening clones containing combinations of 
35 nucleotide sequences f r phenotypes that were ntpr sent 

in the parent cl ne r in the mutant replicas of the 
parent clone. 
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For example, assume a parent clone containing two 
nucleotide sequences A and B is replicated under mutating 
conditions such that variant clones are formed: 
Parent: A/B 
5 Variant 1: Al/B 

Variant 2: A/Bl 

variant 3: A2/B1 ^ 

Variant 4: A/B2 
Variant 5: A3/B 
10 However, within this mutated population, the combinations 
A1/B2, A2/B, A2/B2, A3/B1, and A3/B2, do not occur. If 
the mutant population (including some non-mutated parent 
clones) is allowed to recombine sequences A and B and 
their variants, then combinations such as A1/B2, A2/B etc. 
15 can be created. Such new combinations may express a , 
desired phenotype that was not present in the parental or 
the variant population. 

in one aspect, the present invention is related to 
methods for tapping the immunological repertoire by 
20 isolating from ^-coding and V L -coding gene repertoires 
genes coding for a heterodimeric antibody receptor capable 
of binding a preselected ligand. Generally, the method 
combines the following elements: 

1 . isolating nucleic acids containing a substantial 
25 portion of the immunological repertoire. 

2. Preparing polynucleotide primers for cloning 
polynucleotide segments containing immunoglobulin V H and V t 
region genes. 

3. Preparing a gerie library containing a plurality 
30 of different V„ and V L genes from the repertoire. 

4. Expressing the V H and V t polypeptides in a 
suitable host, including prokaryotic and eukaryotic hosts, 
on the same expression vector. 

5. sere ning the expressed polypeptides f r the 
35 preselected activity, and segregating a V H - and V r coding 

gene combination identifi d by the screening pr cess. 
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In one aspect, the expressed phenotype produced by 
the methods by the present invention comprises a multi- 
meric polypeptide product (i.e. a heterodimer, etc.) which 
assumes a conformation having a binding site specific for, 
as evidenced by its ability to be competitively inhibited, 
a preselected or predetermined ligand such as an antigen, 
enzymatic substrate and the like. In one embodiment, the 
multimeHc~poiypep1:i"de^ 


gen binding site which specif icaliy binds to a preselected 
10 antigen to form an immunoreaction product (complex) having 
a sufficiently strong binding between the antigen and the 
binding site for the immunoreaction product to be iso- 
lated. The antibody typically has an affinity or avidity 
is generally greater than 10 5 -M" 1 . 
15 In another embodiment, a multimeric polypeptide 

produced according to the present invention is capable of 
binding a substrate and catalyzes the formation of a 
product from the substrate. While the topology of the 
ligand binding site of a catalying multimeric polypeptide 
20 is probably more important for its preselected activity 
than its affinity (association constant or pKa) for the 
substrate, the useful catalytic multimeric polypeptides 
typically have an association constant for the preselected 
substrate generally greater than 10 s M*\ more usually 
25 greater than 10 s W 1 or 10 6 M"' and preferably greater than 


30 


10 7 M* 1 


Preferably the multimeric polypeptide produced 
according to the present invention is heterodimeric and is 
therefore normally comprised of two different polypeptide 
chains, which together assume a conformation having a 
binding affinity, or association constant for the pre- 
selected ligand that is different, preferably higher, than 
the affinity or association constant of either of the 
polypeptides alone, i.e., as monomers. In a particularly 
35 pr ferr d aspect, one or both of the different polypeptide 
chains is derived from the variable region f the light 
and heavy chains of an immunoglobulin. Typically, poly- 
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10 


15 


peptides co-prising the light (V,) and heavy (VJ variable 
regions are employed together for binding the preselected 

"'"A V or v t produced by the methods of the subject 
invention can be active in monomeric as well as multimeric 
forms, either bomomeric or heteromeric. preferably hetero- 
dimer-ic.^V (J? ^li?ana binding polypeptide produced 
by the present invention ^i^dvantageously-combiaed in 
fheterodime, (antibody molecule, to modulate the activity 
of either or to produce an activity unique to the het«ro- 
dimer. The individual ligand binding polypep tl des will b* 
referred to as V, and V t and the heterodimer will be 
referred to ae an antibody molecule. 

However, it should be understood that a v„ binding 
polypeptide may contain in addition to the V substan- 
tially .11 or a portion of the heavy chain constant 
«,il . V, binding polypeptide may contain, 
to the V l( substantially .11 or a portion of the light 
chain constant region. A heterodimer comprised of a V, 
20 binding polypeptide containing a portion of the heavy . 
" ^1 constant region and a V t binding containing substan- 
tially all of the light chain constant region !s termed a 
Fab fragment. The production of a Fab can be advantageous 
in some situations because th. additional constant region 
25 sequences contained in a Fab as compared to a F oould 
stabilise the V, and V t interaction. such stabilisation 
could cause the Fab to have higher affinity for antigen 
In addition the Fab is more commonly used xn the art and 
thus there are more commercial antibodies available to 
30 specifically recognize a Fab. „*,~ fl 
The individual V, and V L polypeptides may be produced 
in lengths egual or substantially equal to their naturally, 
occurring lengths. However, the individual V, and V, poly- 
peptides will g nerally have fewer than 125 amino acid 
35 residues, more usually f wer than about 120 amino acid 
r sidu s, while n rmally having greater than 60 amino acid 
residues, usually greater U»n about 95 amino acid 
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residues, more usually greater than about 100 amino acid 
residues. Preferably, the V„ will be fro* about 110 to 
about 125 amino acid residues in length while V L will from 
about 95 to about 115 amino acid residues in length. 

The amino acid residue sequences of the polypeptides 
will vary widely, depending upon the particular idiotype 
.involved. Usually, there will be at least two cysteines 
separated by from about 60 ^75-amino-acid-residues_and 


10 


20 


joined by a disulfide bond. The polypeptides produced by 
the subject invention will normally be substantial copies 
of idiotypes of the variable regions of the heavy and/or 
light chains of immunoglobulins, but in some situations a 
polypeptide may contain random mutations in amino acid 
residue sequences in order to advantageously improve the 

15 desired activity. 

In some situations, it is desireable to provide for 
covalent cross linking of the V„ and V L polypeptides, which 
can be accomplished by providing cysteine resides at the 
carboxyl termini. The polypeptide will normally be pre- 
pared free of the immunoglobulin constant regions, however 
a small portion of the J region may be included as a 
result of the advantageous selection of DNA synthesis 
primers. The D region will normally be included in the 

transcript of the V„. 

in other situations, it is desirable to provide a 
peptide linker to connect the V L and the V„ to form a 
single-chain antigen-binding protein comprised of a V H and 
a V t . This single-chain antigen-binding protein would be 
synthesized as a single protein chain. Such a single- 
chain antigen binding proteins have been described by Bird 
et al., science . 242:423-426 (1988). The design of 
suitable peptide linker regions is described in U.S. 
Patent No. 4,704,692 by Robert Landner. 

Such a peptide linker may be design d as part of the 
nucl ic acid sequences contained in the expression vector. 
The nucleic acid sequences coding for th peptide linker 
would be btwen the V, and V t DNA hoaologs and the 


25 
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reaction endonuclease sites used to cperat^ely link 
the V„ and V. DNA homologs to the expression vector. 

Such a peptide linker also Bay be coded for nucleic 
acid sequences that are part of the polynucleotide primers 
used to prepare the various gene libraries. The nucleic 
acid sequence coding for the peptide linger can be made up 
" nuclei* acids attached to one ot the priors or the 

OI ^^ 4 W »-for-the-peptide l inker ma y be 

nucleic acid sequence coding for the pep — 


nucleic acia Bequc**w^ ^ --=> - . . ^ . ^ . v _ 

aerived from nucleic acid fences that are attached to 
10 several polynucleotide priors used to create the gen. 

librB T^!cally the C terminus region of the V, and V t 
polypeptides will have a greater variety of the 
toaTthe H terminus and. based on *™"*""?% e 
15 c a„ be further modified to permit a variation of the 
normally occurring V. end V t chains. A synthetic 
polynucleotide and be employed by vary one or more amino 
in an hypervariable region. 

i. n rl' f^"" of " n ™ yrertgira _ 

According to one aspect ot the P"" n V lnVe ""°"' ! 
gene repertoire useful in the methods the present xnven- 
lon contains at least 10>, preferably at east 0 more 
preferably at least lo', and most preferably at least 10 
different consderved genes. Methods for evaul.ting the 

25 diversity of a repertoire of conserved genes are well 
known to one skilled in the art. 

various well known methods can be employed to produce 
a useful gene repertoire. For example, to prepare ^ a 
composition of nucleic acids containing a substantial. 

30 portion of the immunological gene repertoire a source of 

30 portion «,_ d/or v polypeptides is required, 

genes coding for the V, and/or v L pv-ir r _ , 

reeferably the source will be heterogeneous population of 
antibody producing cells, i^, .» lymphocytes 
preferably r arranged B cells such as those found in the 

3S circulati n or spleen of a vertebra* . t*"™* 1 * 
c lis are those in which immunoglobulin gene transloca- 
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tion, i.e.,. rearrangement, has occurred as evidenced by 
the presence in the cell of mRNA with the immunoglobulin 
gene V, D and J region transcripts adjacently located 
thereon.) 

in some cases, it is desirable to bias the repertoire 
for a preselected activity, such as by using as a source 
of nucleic acid cells (source cells) from vertebrates in 

^To^^f^rio^ii^ 


any uu«s wj. * w ** w ~ — ^ — 

response. For example, repeated immunization of a healthy 
10 animal prior to collecting rearranged B cells results in 
obtaining a repertoire enriched for genetic material 
producing a ligand binding polypeptide of high affinity. 
See, e.g. Mullinax et al., ?T™, Pat f ftc^t Sc&. (PSA) 
£2:8095-8099 (1990). Conversely, collecting rearranged B 
15 cells from a healthy animal whose immune system had not 
been recently challenged results in producing a repertoire 
that is not biased towards the production of high affinity 
V„ and/ or V L polypeptides. 

It should be noted the greater the genetic hetero- 
20 geneity of the population of ceils for which the nucleic 
acids are obtained, the greater the diversity of the 
immunological repertoire that will be made available for 
screening according to the method of the present inven- 
tion. Thus, cells from different individuals of different 
25 strains, races or species can be advantageously combined 
to increase the heterogeneity (diversity) of the 
repertoire. 

Thus, in one preferred embodiment, the source cells 
are obtained from a vertebrate, preferably a mammal, which 

30 has been immunized or partially immunized with an anti- 
genic ligand (antigen) against which activity is sought, 
i e., a preselected antigen. The immunization can be 
carried out conventionally. Antibody titer in the animal 
can be monitored to determine the stage of immunization 

35 desired, which stage corresponds to the amount f enrich^ 
*ent or biasing of the repertoire desir d. Partially 
immunized animals typically receive only one immunization 
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and cells are collected therefrom shortly after a response 
" detected. Fully immunized animals display a peak 
titer, which is achieved with one or more 
tions of the antigen into the host mammal, normally at 2 
, to 3 week intervals. Usually three to five days after the 
last challenge, the spleen is removed and 

iep=rtoire^of_the_spleenocytes, about 90* of which are 

rearranged B cells, irisolated^i^tai^ard-procedures 
see, -iirrrnt -^r™ 1 !. m Trim'" Rusubel et 

io al.. eds., John Wiley t sons, ire. 

Kucl.ic acids coding for V, and v t polypeptides can he 
aerived from cells producing IgA, 19°. « 
most preferably from XgH and Producing cells. 

Methods for preparing fragments of genomic DNA f rom 
» which immunoglobulin variable region genes «n clon °* 
as a diverse population are well known in the art See 
£ or example Herrmann et al., * * *** Tn TOW**. , 
ill, -CiM7>i Frischauf, U r ^r Tn ^f.YBOl. , in.wo » 
(19 „7>; Friechauf, „r t tl0^ Tn fin«Y»rt.. . 
2 6 1M7), and DiLella et .1., BS - tl^ Ttl TMWPU . 

2U (1987) . (The teachings of the references cited herein 
are hereby incorporated by reference.) ,„„,„„, 
The desired gene repertoire can be isolated from 
either genomic material containing the gene expressing the 
2S variable region or the messenger SKA <™J***££1. 
sen ts a transcript of the variable region. The difficulty 
in using the genomic DNA from other than non-rearranged B 
ymphocytes is in Juxtaposing the seguences coding for the 
viable region, where the seguences are separated by 
30 intervening regions. The DHA fragment (s) containing the 
"roper variable regions must be isolated, the intervening 
regions excised, and the variable regions then *» 
th proper order end in the proper orientation. For the 
part, this will be difficult, so that the '"^native 
„ T hnigu. employing rearran, d B cells will 

of choic because the V, D and J immunoglobulin gene 


SUBSTITUTE SHEET 


WO 91/16427 


PCI7US91/02910 


42 


regions have translocated to become adjacent, so that the 
sequence is continuous for the variable regions. 

Where mRNA is utilized the cells will be lysed under 
KNase inhibiting conditions. In one embodiment, the first 
5 step is to isolate the total cellular mRNA by hybridiza- 
tion to an oligo-dT cellulose column. The presence of 
mRNAs coding for the heavy and/or light chain polypeptides 

can then be assayed by hybridization vith DNA-single 

strands of the appropriate genes. Conveniently, the 

10 sequences coding for the constant portion of the V H and V L 
can be used as polynucleotide probes, which sequences can 
be obtained from available sources. See for example, 

Early and Hood, Genetic Engineering, Setlow and 

Hollaender, eds.. Vol. 3, Plenum Publishing Corporation, 

15 New York, (1981), pages 157-188? and Kabat et al., 
of 7 rinT] ^^», rnterest. National Institutes 
of Health, Bethesda, MD, (1987). Exemplary methods for 
producing V B and V L gene repertoires are described in PCT 
Application No. PCT/US 90/02836 (International Publication 

20 NO. WO 90/14430) . 

In preferred embodiments, the preparation containing 
the total cellular mRNA is first enriched for the presence 
of V, and/or V L coding mRNA. Enrichment is typically 
accomplished by subjecting the total mRNA preparation or 

25 partially purified mRNA product thereof to a primer 
extension reaction employing a polynucleotide synthesis 
primer of the present invention. 

According to another aspect of the present invention, 
a gene repertoire may be generated from one or a few 
30 nucleotide sequences by replicating those sequences under 
mutagenesis conditions so that a plurality of different 
nucleotide sequences or genes may be generated. Suitable 
mutagenesis conditions are known to those skilled in the 
art. 
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2. 


2 ^H«n Of Tnl-.TI^I-^^ »H»a*» 
The tirB -pol^leotid^^sused hJ^^ reference 

to priors, probes and nucleio acid ^^"J^Tl 
to be synthesized by primer extension is defined as a 
roXecuu'Urised of two or more ^^^^ 
ribonucleotides, preferably more than 3. 
^l^pend-on-many-fac^^^ 


30 


35 


ultimate conditions of use. 

The term "primer" as used herein refers to a poly- 
nucleotide whether purified from a nucleic acid 
"eviction digest or produced synthetically, which »s 
"pan e of acting as a point of initiation 
vhen placed under conditions in which synthesis of a 
£Imer tension product which is complementary to a 
nucleic acid strand is induced, i.e.. in the presence of 
nucleotides and an agent for polymerize . such as DKX 
pclymer.se, reverse transcriptase and the IxXe, and at a 
suitable temperature and pH. The primer is preferably 
"ngle stranled for maximum efficiency, hut may alterna- 
tively he stranded. If double stranded, the primer is 
first treated to separate its strands before being used 
""repare extension products. Preferably, the primers 
I polydeoxyribonucleotide. The primer must be suffi- 
cient* loX to prime the synthesis of extension products 
I- Ie presence of the agents for polymerisation. The 
exact lengths of the primers will depend on 
including temperature and the source ■ °*J™~- ™ 
example, depending on the complexity of «.e target 
exa ^ ' , ... - „_ <ffi e>r tvoically contains 15 to 

sequence, a polynucleotide primer typxcax y 

25 or more nucleotides, although it can contain fewer 
betides. short primer molecules generally retire 
cooler temperatures to form sufficiently stable hybrid 

complexes with template. . 

The prim rs used herein are select d to be 
..substantially- complementary to the different strands f 
ea ch sp cific s quence to be synth sized or amplified. 
ZTs me'ans Lt the primer must be sufficiently c mpl - 
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mentary to nonrandomly hybridize with its respective 
template strand. Therefore, the primer sequence need not 
reflect the exact sequence of the template. For example, 
a non-complementary nucleotide fragment can be attached to 
the 5« end of the primer, with the remainder of the primer 
sequence being substantially complementary to the strand, 
such noncomplementary fragments typically code for an 
endonuclease r estri ction-site^Mternatively,-noncomple^ 


enaonucxea&e iwwi^w*v.. 

Bentary bases or longer sequences can be interspersed into 
10 the primer, provided the primer sequence has sufficient 
complementary with the sequence of the strand to be syn- 
thesized to amplified to non-randomly hybridize therewith 
and thereby form an extension product under polynucleotide 
synthesizing conditions. 
15 Primers of the present invention may also contain a 

DNA-dependent RNA polymerase promoter sequence or its 
complement. See for example, Krieg et al., Nucleic Acids 
Eesearch, 12:7057-70 (1984); Studier et al., J, 
BiBU, 189:113-130 (1986); and NnlPnilar CI on jpq: A 
20 ■T^^nrvMa^' — * Edition, Maniatis et al., eds. , 
Cold Spring Harbor, NY (1989) . 

When a primer containing a DNA-dependent RNA poly- 
merase promoter is used, the primer is hybridized to the 
polynucleotide strand to be amplified and the second 
25 polynucleotide strand of the DNA-dependent RNA polymerase 
promoter is completed using an inducing agent such as 
coll DNA polymerase I, or the Klenow fragment of E^fisU 
DNA polymerase. The starting polynucleotide is amplified 
by alternating between the production of an RNA poly- 
30 nucleotide and DNA piynucleotide . 

Priaers may also contain a template sequence or 
replication initiation site for a RNA-directed polymerase. 
Typical RNA-directed RNA polymerase include the QB repli- 
case described by Lizardi et al. plotechnologv , 611197- 
35 1202 (1988) . RNA-directed polymerases produce larg 
numebrs f RNA strands from a small number of template RNA 
strands that contain a template sequence r replication 
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initiation site. These polymerases typically give a one 
B illion-f old amplification of the template strand, as has 
Len described by Kramer et al., SUJUU^UU. 8917819 " 
736 (1974) . 

The polynucleotide primers can be prepared using any 
suitable method, such as, for example, the phosphotriester 
on-phosphodiester_meU^_^ee_Narang et al., Meth^ 


Brown et al., Hf til , Ttormvl , . «'"> 9 ' < 1979 > • 
10 The choice of a primer's nucleotide sequence depends 

on f actors such as the distance on the nucleic acid frc» 
the region coding for the desired receptor, its hybrid- 
isation sit. on the nucleic acid relative to any second 
pri-er to oe used, the nusber of genes in t*e repertoire 
15 it is to hybridize to, and the like. 

(a) r ~< r -~ Prodnrlm V„ and V,, . W ^omoj-ogs , 

V H and V L gene repertoires can be separately prepared 
prior to their use in the methods of the present Inven- 
tion. Repertoire preparation is typically done by prxmer 

20 extension (or ether in^itro amplif icaiton method,, 
preferably by primer extension in a PC* format. 

For example, to produce V H -coding DNA homologs by 
primer extension, the nucleotide sequence of a V*™ ** 
selected to hybridize with a plurality of xmmunoglobulin 

25 heavy chain genes at a site substantially adjacent to the 
V -coding region so that a nucleotide sequence codxng for 
Afunctional (capable of finding, polypeptide is obtained. 
To hybridize to a plurality of different V H -codxng nuclexc 
acid strands, the primer must be a substantial complement 

30 of a nucleotide sequence conserved among the different 
strands. Such sites include nucleotide sequences in the 
constant region, any of the variable region framevork 
r gions, preferably the third framework region, leader 
regi n, promoter regi n, J region and the like. 

If the v„-coding and V L -coding DNA hom logs are to be 
produced by P lymerase chain reaction (PCR) amplification. 
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two primers must be used for each coding strand of nucleic 
acid to be amplified. The first primer becomes part of 
the nonsense (minus or complimentary) strand and hybrid- 
izes to a nucleotide sequence conserved among V H (plus) 
5 strands within the repertoire. To produce V H coding DNA 
homologs, first primers are therefore chosen to 
~to ^e,^complementary to) conserved regions within the 

j region, CHI region, hinge-^tf^C^region^or-C^ 

region of immunoglobulin genes and the like.. To produce 
10 a v L coding DNA homolog, first primers are chosen to 
hybridize with (i.e. be complementary to) a conserved 
region with the J region or constant region of immuno- 
globulin light chain genes and the like. Second prxmers 
become part of the coding (plus) strand and hybridize to 
15 a nucleotide sequence conserved among minus strands. To 
produce the V H -coding DNA homologs, second primers are 
therefore chosen to hybridize with a conserved nucleotide 
sequence at the 5 • end of the V H -coding immunoglobulin gene 
such as in that area coding for the leader or first frame- 
20 work region. It snould be noted that in the amplification 
of both V„- and V L -coding DNA homologs, the conserved 5 
nucleotide sequence of the second primer can be comple- 
mentary to a sequence exogenously added using terminal 
deoxynucleotidyl transferase as described by Loh et al., 
25 Science 2^:217-220 (1989). One or both of the first and 
second primers can contain a nucleotide sequence defining 
an endonuclease recognition site. The site can be heter- 
ologous to the immunoglobulin gene being amplified and 

*. ~~ noav- the 5' end of the primer, 
typically appears at or near the » 

30 (b) Tpgirie arifl outside Primers. 

in one embodiment, the present invention utilizes a. 
set of polynucleotides that form inside primers comprised 
of an upstream inside primer and a downstream inside 
primer. Each of the inside primers has a priming region 

35 located at the 3- -terminus of the prim r. The priming 
rgion is typically the 3 '-most (3 • -terminal) 15 t 30 
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nucleotide bases. The 3- -terminal priming portion of each 
inside primer is capable of acting as a pnmer to cataXyze 
nucleic acid synthesis, i.e., initiate a primer extension 
paction off its 3- terminus. One or both of the inside 
primers is further characterized by the presence of a 5 - 
terminal (5' -most) non-priming portion, i.e., a region 
tnat does not participate in hybridization to repertoire 


10 


15 


in fusion PGR, each inside primer works in 
cognation with an outside primer to amplify a target 
nucleic acid sequence. The choice of PGR primer pairs for 
•use in fusion PCR as described herein is governed by the 
sa ,ne considerations as previously discussed for choosing 
PCR primer pairs useful in producing gen 
That is , the primers have a nucleotide sequence that is 
complementary to a sequence conserved in the -pertoire 
Useful V t and V H inside priming sequences are shown in 
Tables 1 and 2, respectively, below. 


20 


25 


30 


35 


Table 1 . 

» » Porti , n- - T "^ d ° V.^rlmers 

Seq. 
jd, flo- 

5' GTGATGACCCACTCTCC 3« 
GTGATGACCCAGTCTCCA 3 
GTTGTGACTCAGGAATCT 3 
GTGTTGACGCAGCCGCCC 3 
GTGCTCACCCAGTCTCCA 3 
CAGATGACCCAGTCTCCA 3 
GTGATGACCCAGACTCCA 3 
GTGATGACCCAGTCTCCA 3 
TTGATGACCCAAACTCAA 3 
_ GTGATAACCCAGGATGAA 3 
Hucleotldes sequences 1-10 .r unique 5- primers for 
the amplification of kappa light chain variable 
regions. 


(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
<8) 
(9) 
(10) 
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-» « Prlmin 

Q PC 


Scq. 



Tr!. NO. 


5 

(II) 1 

5' 


(12) 2 

5* 


(13) 3 

5' 


(14)* 

5* 


(15) 5 

5« 

10 

(16)* 

5« 


(17) 7 

5' 


(18) 8 

5' 


(19)' 

5« 


(20) 10 

5« 

15 

(21) 11 

5« 


ortions 


TGGGGTTTTGAGCTC 3 


20 


25 


30 


35 


TGGAATGGGCACATGCAG 3' ~ " . 
TTATCATTTACCCGGAGA 3 ' 
AACGGTAACAGTGGTGCCTTGGCCCCA 3« 

ACAATCCCTGGGCACAAT 3 ' 
CACCTTGGTGCTGCTGGC 3' 
ACAACCACAATCCCTGGGCACAATTTT 3 • 

ACAATCCCTGGGCACAAT 3' 
GAGTTCACTAGTTGGGCACGGTGGGCA 3 • 

1 unique 3- primer for human IgGl, 2, 3, and 4 F.2d. 

2 Unique 3' primer for human ^ amplification. 

3 3 « primer for amplifying human heavy chain variable 

regions, . ^ _ M 

3 ." primer for amplifying the Fd region of mouse IgM. 
5 3 « primer located in the CHS region of human IgGl to 
amplify the entire heavy chain. 
Unique 3 • primer for amplification of mouse F v . 
Unique 3 • primer for amplification of mouse IgGl Fd. 
Unique 3- primer for amplification of VH including 
part of the mouse gamma 1 first constant region. 
Unique 3. primer for amplification of VH including 
part of mouse gamma 1 first constant region and hinge 

region. . - 

3 i primer for amplifying mouse Fd including part of 
the mouse IgG first constant region and part of the 

hinge region. * 
3 . primer for amplifying human IgGl Fd including part 
of the human IgG first constant region and part f 
the hinge region including the two cysteines ; which 
create the disulfide bridge for producing Fab'2 (the 
primer corresponds to Kabat .number ,24 1QQ to 247). 


10 


11 
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A preferred set of inside primers used herein has 
primers with complementary 5-terminal non-priming 
regions, the coMpie.ent.ry strands of which «re capable o f 
hybridizing to each other to for* a duplex wxth 3- over- 
hangs. L duplex encodes all or part of a double 
branded cistrohic bridge. That is, if 1*e ^erhangs 
of the duplex are filled in with complementary " " e 

to define a double st r^de^-extendin^ -^«-^ 


to define a aouoxe 4.™^,,= 

terminus of one of the inside primers to the 3- -terminus 
10 of^he other of the inside primers, that double stranded 
1 segment 'ferns a eeguenCe of nucleotides 
tively links the upstream and downstream \ . 

polycistronic expression. , Thus, while each of the inside 
frimers in a set contains only a portion ofthe sequence 
« information necessary to form the double ^tranded 
cistronic bridge, the two inside primers i^ oomb,nation 

encode both the plus end minus strands of all or part of 

? " b For 9 «ample, one inside upstream primer can have^a 
20 sequence that forms a portion of the plus strand of the 
br«ge. and the other inside primer: eneode* 
through complementarity, of the downstream portion of the 

, plus s ^^ re£erre(J ^boa^ent, the plus strand of the 
« cistronic bridge contains, in the transnational reading 
frame and from an upstream position to a downstream 
position, sequences coding for <i> at least one stop 
codon, preferably two, in the same reading ^ frame as the 
upstream cistron, (ii, a ribosome binding site, and (iii) 
30 a polypeptiue leader, the translation initiation codon of 
which is in the same reading frame as the downstream 
"Ltrcn. the stop codon is present to 
tion of the upstream cistron. The ribosome binding site 
is present to initiate translation of the downstream 
35 cistron from the p lycistronlc mRNX. 

The predicted amino acid residue sequences of two 
pelB gen. pr duct variants from rnrinin ffirotova are shown 
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in Table 3. Lei, et al., SHEE*. Amino Acid residue 
sequences for other leaders fro* p. coU useful in this 
invention are also listed in Table 3. Oliver, In 
Neidhart, F. C. (ed.), Tumeric*!* coU and SjOjuonella 
o ypMTmirium . American Society for Microbiology, 
Washington, D. C. , 1:56-69 (1987). These regions for the 
-v^vv^chain are contained in the Bodified^IimunoZAP H 
expression vector. Hullinax, et al./ Proc, 
>M . . USA . 87:8095-8099 (1990). 


-Wfi«— Acad^ 


10 


15 


20 


j.f.ader Sequences 
Seq. 


JA. No. 
(22) 


(23) 


(24) 


(25) 


(26) 


25 (27) 


(28) 


30 (29) 


TYPe 
pelB 1 

pelB 2 

MalE 5 


OmpF 5 


PhoA 3 


Bla J 


LamB J 


LPP 


^mliiQ AcJ H Pftsidup ^Amience 
MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuLeu 
LeuLeuAlaAlaGlnProAlaGlnProAlaMetAla 
MetLysSerLeulieThrProIleAlaAlaGlyLeuLeu 
LeuAlaPheSerGlnTyrSerLeuAla 
MetLysIleLysThrGlyAlaArglleLeuAlaLeuSer 
AlaLeuThrThrMetMetPheSerAlaSerAlaLeuAla 

Lyslle 

MetMetLysArgAsnlleLeuAlaVallleValProAla 

LeuLeuValAlaGlyThrAlaAsnAlaAlaGlu 

MetLysGlnSerThrlleAlaLeuAlaLeuLeuProLeu 

LeuPheThrProValThrLysAlaArgThr 

MetSerlleGlnHisPheArgValAlaLeuIleProPhe 

PheAlaAlaPheCysLeuProValPheAlaHisPro 

MetMetlleThrLexiArgLysLeuProLeuAlaValAla 

ValAlaAlaGlyValMetSerAlaGlnAlaMetAlaVal 

Asp 

MetLysAlaThrLysLeuValLeuGlyAlaVallleLeu 
GlySerThrLeuLeuAlaGlyCysSer 


35 


1 

2 
3 


pelB from Prwlnia p afotovcra gene 
pelB from f*r<n <n ^rotovora EC 16 gen 
leader sequences f rom . Ej_j£21A 
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To achieve high levels of gene expression an E_S2ll. 
« is necessary to use not only strong promoters gener- 
ate large quantities o£ mRNA/ but also riboso„e binding 
sites to ensure that the *» is efficiently translated. 

5 in fc-aaU. ribos0M binain9 * ite 1 S •? f 
WoHoTon (AUG) and a sequence 3- £j 
seated 3 U nucleotides upstrea* from the initiation 
cLon IShine et . IT^^-^^-^SH The 

sequence. X^GG*. which is called the <*°> 

, ••-« 3« end of E. coll 16s 

10 seouence, is complementary to the 3 

10 S- Biding of the ribosome to mRNA and the 

3- end of the mRNA can he affected by several factors 
(i, The degree of complementarity between the SD 
sequence and 3 ' end of the 16S tRNA. 

(ii) The spacing and possibly the DNA sequence lying 
between the SD sequence and the AUG [Roberts etal ., 
tt~T 1 r > ^« tISA . 76:760 (1979A) ; Roberts et al . , 

al..' Science, 209:1428 (1980); and Guarente et al.. Cell, 
ax., h . aC hieved by measuring the 

20 20:543 (1980) .] Optimization is achieve* y _ 

level of expression of genes in plasmids in whic* this 
spacing is systematically altered. Comparison of differ^ 
ent mRNAs .hows that there are statistically preferred 
sequences from positions -20 to + 13 (where the A of the 

25 Aufis position 0, [Gold et al., Mitm rt«rrti*l„ 
35:365 (1981) ] . leader sequences have been shown to 
infiuence translation dramatically (Roberts et al. 1979 a, 

b ^ii) The nucleotide sequence following the AUG, 
30 Which affects ribosome binding [Taniguchi et al., CL-MeX,. 

BioU, 118:533 (1978)]. . _ 

— x~*~Air*n sites are shown in Table 4 

Useful ribosome binding sires 

below. 
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p< raping Bind ing Sites* 

Seq. 
•xa. Ko. 

(30) 5. AAUCUUGGAGGCUUUUUUAUGGUUCGUUCU 

2 * (31 ) 5. UA ACE&AGSAUGAAAUGCA2gOCDAAGACA 

3 * (32) 5« UCCTAG^AfiS2DUGACCD£2£CGAGCUUUU 

4 # (33)~^ ? . ^ CT T^T^yr.GKGGUUGU&gGGAACAACGC__ 

* sequences of initiation regions for protein . 
10 synthesis in four phage mRNA molecules are underlined. 
AUG « initiation codon (double underlined) 

1. «= Phage $X174 gene-A protein 

2. ■ Phage Qfi replicase 

3. = Phage R17 gene-A protein 

15 4. - Phage lambda gene-cro protein 

It is preferred that the complementary (overlapping) 
region of the inside primers and the priming portion of 
the inside primers have about the same denaturatxon 
temperature, Td. The Td of a sequence can be estimated by 
20 the following formula: Td - 4(C + G) + 2(A + T), where C, G. 
A and T represent the respective number of cytosxne, 
guanine, adenine and thymine bases in the sequence. A Td 
for the above-identified hybridizing region of about 45- 
55-C, preferably about 50'C, is preferred. Typically, 
25 overlapping regions in the range of about 15 to 20 
nucleotides works well in conjunction with the primxng 
regions in the range of 15-30 nucleotides. 

The set of outside primers forms the termini of the 
dicistronic DNA molecule. The set of .outside primers 
30 comprises an upstream outside primer and a downstream 
outside primer. The outside primers each comprxse a 3 - 
terminal priming portion, and preferably a portion that 
defines an endonuclease restriction site. When present, 
th restriction site-defining portion is typically located 
35 in a 5 '-terminal non-priming portion of th outside 
primer. The restriction site defined by the upstream 
outside primer is typically chosen to be one recognized by 
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a restriction enzyme that does not recognize the restric- 
tion site defined by the downstream outside primer, the 
objective being to be able to produce a dicistronic DNA 
having cohesive termini that are non-cbmpelementary to 
each other and thus allow directional insertion into a 
vector. 

Useful outside primer sequences are shown in Tables 

5 and"~6~below; • . _____ 


yable 5 
10 Q"f-s-d_ V „ Primers 
Seg. 


13, ?*o. 

(34) 1 5 » AGGTCCAGCTGCTCGAGTCTGG3 

(35) 5 
15 (36) 

(37) 5' 

(38) 5 

(39) 5 ' 

(40) 5' 

20 (41) 5 * 

(42) 2 5V«— " 

( 4 3 j 5 • AGGTGCAGCTGCTCGAGTCGGG3 

(44) 
(45) 


5'; 


AGGTCCAGCTGCTCGAGTCTGG3 ' 
AGGTCCAGCTGCTCGAGTCAGG3 • 
AGGTCCAGCTTCTCGAGTCTGG3 1 
AGGTCCAGCTTCTCGAGTCAGG3 « 
AGGTCCAACTGCTCGAGTCTGG3 ' 
i AGGTCCAACTGCTCGAGTCAGG3 ' 
AGGTCCAACTTCTCGAGTCTGG3 ' 
AGGTCCAACTTCTCGAGTCAGG3 1 
AGGTGCAGCTGCTCGAGTCTGG3 


s - AGGTGCAACTGCTCGAGTCTGG3 • 
5 * AGGTGCAACTGCTCGAGTCGGG3 ' 
Nucleotide sequences 21-28 are unique 5' primers for 
the amplification of mouse V H genes. 
? Nucleotide sequences 29-32 are unique 5' primers for 
amplification of nucleic acids coding for human 
variable regions. 

30 yab^e 6 

mil-side V, primers 

Seq. 

„■ 

(46) 1 5« ACGTCTAGATTCCACCTTGGTCCC 3' 

35 (47) 2 5 • TCCraCTAGATTACTAACACTCTCCCCTGTTGAA 3 ' 
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20 


(48) 3 5' GCATTCTAGACTATTAACATTCTGTAGGGGC 3' 

(49 )*■ 5' GCAGCATTCTAGAGTTTCAGCTCCAGCTTGCC 3 • 

( 5 0 ) 5 5 ' CCGCCGTCTAGAACACTCATTCCTGTTGAAGCT 3 • 

(51) 6 5« CCGCCGTCTAGAACATTCTGCAGGAGACAGACT 3' 

(52) 7 5' GCGCCGTCTAGAATTAACACTCATTCCTGTTGAA 3' 

(53) 8 5' GCCGCTCTAGAACACTCATTCCTGTTGAA 3' 

(54) 9 5' TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 3' 
~JS5) W 5' GCA TTCTAGACTATTATGAACATTCTGTAGGGGC-3J, 


25 


3' primer for amplifying human kappa chain variable 

10 regions. 

2 3« primer in human kappa light chain constant region. 

3 3 , primer in human lambda light chain constant 
region. 

* Unique 3 • primer for amplification of kappa light 
15 chain variable regions. 

5 Unique 3« primer for mouse kappa light chain 
amplification including the constant region. 

* Unique 3' primer for mouse lambda light chain 
amplification including the constant region. 

7 unique 3' primer for amplification of kappa light 
chain. 

« unique 3 • primer for amplification of mouse kappa 
light chain. 

9 Unique 3« primer for kappa V L amplification. 

Unique 3' primer for human, mouse and rabbit lambda 
V L amplification. 


30 


10 


3. Preparing a Gen e library 

The strategy used for cloning, i.e., substantially 
reproducing the V H and/or V L genes contained within the 
isolated repertoire will depend, as is well known in the 
art, on the type, complexity, and purity of the nucleic 
acids making up the repertoire. Other factors include 
wh ther or not the genes ar contained in one or a 
plurality of repertoires or populations and whether or n t 
35 they are to be amplified and/or mutagnized. 
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a. rrrpnrin-T "„ v , ? Varies 

in one strategy, the object is to clone the V H - and/or 
V -coding genes from a repertoire comprised of polynucleo- 
tide coding strands, such as mRNA and/ or the sense strand 
of genomic DNA. If the repertoire is in the form of 
double stranded genomic DNA, it is usually first dena- 
tured, typically by melting, into single strands. The 
repertoire i ^-TO^cted-to-a-first-primer-extension 


reaction by treating (contacting) the repertoire with a 
10 first polynucleotide synthesis primer having a preselected 
nucleotide sequence. The first primer is capable of ini- 
tiating the first primer extension reaction by hybridizing 
to a nucleotide sequence, preferably at least about 10 
nucleotides in length and more preferably at least about 
15 20 nucleotides in length, conserved within the repertoire. 
The first primer is sometimes referred to herein as the 
"sense primer- because it hybridizes to the coding or 
sense strand of a nucleic acid. In addition, the second 
primer is sometimes referred to herein as the "anti-sense 
20 primer" because it hybridizes to a non-coding or anti- 
sense strand to a nucleic acid, i.e., a strand 
complementary to a coding strand. 

The PGR reaction is performed by mixing the PGR pair, 
preferably a predetermined amount thereof, with the 
25 nucleic acids of the repertoire, preferably a predeter- 
mined amount thereof, in a PGR buffer to form a first PCR 
admixture. The admixture is maintained under polynucleo- 
tide synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the formation of 
30 a PCR reaction product, thereby producing a gene library 
containing a plurality of different V H - and/or V t -coding 

DNA homologs. ' 

A plurality of first primer and/or a plurality of 
s cond primers can be used in each amplification, e.g., 
35 one species f first primer can be paired with a number of 
second primers to form several different primer pairs. 
Alt matively, an individual pair of first and sec nd 
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primers can be used. In any case, the amplification 
products of amplifications using the same or different 
combinations of first and second primers can be combined 
to increase the diversity of the gene library. 
5 In another strategy, the object is to clone the V H - 

and/or V L -coding gene from a repertoire by providing a 
polynucleotide complement of the repertoire, such as the 

anti^sense^strand,of^genomi c dsDNA or the polynucleotide 

produced by subjecting raRNA to a reverse transcriptase 
10 reaction. Methods for producing such complements are well 
known in the art. The complement is subjected to a primer 
extension reaction similar to the above-described second 
primer extension reaction, i.e., a primer extension 
reaction using a polynucleotide synthesis primer capable 
15 to hybridizing to a nucleotide sequence conserved among a 
plurality of different V„-coding gene complements. 

The primer extension reaction is performed using any 
suitable method. Generally it occurs in a buffered aque- 
ous solution, preferably at a pH of 7-9, most preferably 
about 8. Preferably, a molar excess (for genomic nucleic 
acid, usually about 10*: 1 primer: template) of the primer is 
admixed to the buffer containing the template strand. A 
large molar excess is preferred to improve the efficiency 
of the process. 

25 The deoxyribonucleotide triphosphates dATP, dCTP, 

dGTP, and dTTP are also admixed to the primer extension 
(polynucleotide synthesis) reaction admixture in adequate 
amounts and the resulting solution is heated to about 
90'C-100«C for about 1 to 10 minutes, preferably from 1 
to 4 minutes. After this heating period the solution is 
allowed to cool to room temperature, which is preferable 
for primer hybridization. To the cooled mixture is added 
an appropriate agent for inducing or catalyzing the primer 
ext nsion reaction, and the reaction is allowed to occur 
under conditions known in the art. The synth sis r action 
nay occur at from room temp rature up to a temp rature 
above which the inducing agent no longer functions effi- 


20 


30 


35 
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• t iv Thus for example, if DNA polymerase is used as 
^ ZZ. - mature is g enerally no greater 

than about 40-c. anv compound or system 

The inducing agent -* * « of priTCr 

which wilX function to aocomplish ite *» 

tension >^ ^ ^ - 

roerase I f Klenow ir y M lvmerases , reverse 


„erase Klenow r^^ --— , ges , MW MW — 

DHA polymerase. ot*er_ inclua ing heat-stable 
0 transcriptase, ana othe, ^ of the nucleo- 

enzy.es, which Will facilitate e extension 

"ToT srr ssl^t- — — « 

EST ™- the synthesis willbe^i tiat - - - 
L5 end of each primer and terminates. 
al ong the template strand ^ ^ Bay be 

producing molecules of -"^J^ synt hesis at the 
inducing ^ts ^owever which ^ ^ _ 

5« end and proceed in tne aww 
20 process as described above. . compound or system 

The inducing agent ^esis of t» 

which will function to enzymes. In 

prl „er extension ^^'^^Lt may\e a DNA-. 
preferred embodiments, ^'J^V polymerase, T3 RNA 
25 aependent BH^ymerase suoh as^ ^ 

polymerase or^ SP (W* J ^^tide . The high turn 
produce a complementary RNA P°«f t he starting 

overrate of the ™ Berlin et al . , 

pol y„ucleotide as has ^ d "^^^ press . New 

jj^jgjjzyjies, ed. P. Boyer, ^ polymerase is 

yor, ,„*>. ^/^u^d in" ^e Nucleotide 
that mutations can be cDSA vitt one or more 

synthesis by replacing a portion of ^ 

Mt agenic -i*-** « 

35 transcribing the partially m ^ lfiJ£lsis . 


overrate 
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bas ed on transcription have been described by Gingeras et 

" in r- 1 1- * 0^.fl e_ t" 

^^ ism , PP. 245-252. Academic Press, Inc., San 

Diego, CA (1990) . DNA -a ep endent RNA 

If the inducing agent is a u»» f 
oolvmerase and therefore incorporates ribonucXeotide 
S^osPnates. sufficient counts of ATP, CTP GTP and «TP 
^dsLdtothe -primer-extensio^reacti o, 
the resulting solution is treated as described above. 

The newly synthesized strand and its complementary 
nucleic add strand fori, a double-stranded molecule which 
can be used in the succeeding steps of the process. . 

The first and/or second primer extension reaction 
discussed above can advantageously be used to incorporate 
tntfthe multimeric polypeptide a preselected epitope 
u "eful in ^ologically detecting and/or i-^g a 
m"timeric polypeptide. This is accomplished by utiUzing 
a first and/or second polynucleotide syntheses primer or 
Session vector to incorporate a predetermined amino 
tcirresidue sequence into the amino acid residue sequence 

of the receptor. _ ^ homologs for 

After producing V,- and/or v t cot* i 
a plurality of different v.- and/or v t -coding genes within 
Z repertoire, the homologs are typically amplified 
, while «L v„ and/or feeding DNA homologs can be amplified 
^classic "technics such as inco rpo ration o - aut^ 

ampxia-y w* *««5t=»rtina them into a 

asc chain reaction (PGR) prior to inserting th 
_ r In fact, in preferred strategies, the first 

° ^ r'seco:, primer extension reactions use, to proa.ce 
^ gene library are the first and second primer extension 
reactions in a polymerase chain reaction. 

PGR is typically carried out by cycling i.e., 

5 simultaneously performing in one admixture, 

described first and s cond primer extension r actions, 
tlcT^cle c mprising polynucleotide synthesis followed by 
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library ana thereby V ^ be notea ^ 

^ 1 r SeXf t ^-^r"-^ in to 
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the mutation inducting variations described in the above 
referenced U.S. Patent No. 4,683,195, other mutation 
inducing. PCR variations can be employed. For example, the 
PGR reaction admixture, i.e., the combined first and 
5 second primer extension reaction admixtures, can be formed 
with different amounts of one or more of the nucleotides 
to be incorporated into the extension product, Under such 

tide substitutions within the extension product as a 
10 result of the scarcity of a particular base. Similarly, 
approximately equal molar amounts of the nucleotides can 
be incorporated into the initial PGR reaction admixture in 
an amount to efficiently perform X number of cycles, and 
then cycling the admixture through a number of cycles in 
15, excess of X, such as, for instance, 2X. Alternatively, 
nutations can be induced during the PCR reaction by incor- 
porating into the reaction admixture nucleotide deriva- 
tives such as inosine, not normally found in the nucleic 
acids of the repertoire being amplified. During subse- 
20 guent in vivo amplification, the nucleotide derivative 
wiU be replaced with a substitute nucleotide thereby 
inducting a point mutation. 

b p^rmr-lna a "^-^^ rm molecule library 
in one embodiment, a library of dicistronic DNA 
25 molecules containing upstream and downstream cistrons 
• operatives linked by a cistronic bridge can be produced 
by the following steps: 

(a) Subjecting a repertoire of first polypeptide 
genes (e.g., V H -coding genes) , to PCR amplification using 

30 first outside and first inside primers, i.e., a first PCR 
primer pair, to form a first primary PCR product. 

(b) subjecting a repertoire of second polypeptide 
genes (e.g., V t -coding genes) to PCR amplification using 
second outside and second inside pirmers, i.e., a second 

35 PCR primer pair, to form a second primary PCR product. 
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(c) Hybridizing the first and second primary PCR 
products to for* internally (self) primed duplexes, i.e., 
duplexes having 3 • -hybridized and 5 • -overhanging termini. 

(d) Subjecting the internally-primed duplexes to 
priffi er extension reaction conditions to form double 
stranded duplexes having substantially blunt, P^erably 
blunt, termini and a dicistronic strand containing the 
upstream and downstream cistrons TOKeo-by a^cistronic 
Tioge encoded by the inside primers. By "substantially 

is -ant having no more tban about one or two 
overhanging nucleotides. (Substantially blunt double 
stranded DNA is sometimes produced by primer overextension 
b y Tag polymerase, usually by the addition of one or two 
terminal adenine residues.) 

The v„- and v t -coding gene repertoires, are comprised 
of polynucleotide coding strands, such as »» and/or the 
sense strand or genomic DMA. If the repertoire is in the 
fon . of double stranded genomic DBA, it is usually first 
denatured, typically by melting, into single strands. A 
repertoire is subjected to a PCR reaction as described in 

Section 3a hereinabove. 

in preferred embodiments the ratio of gene molecules 


25 


30 


35 


and their respective primers is as follows: about 1 x 10 
V, gene molecules to about 1 x 10« outside V H gene 
t "o about 1 x 10' outside V, primer molecules, about 1 x 10 
V gene molecules, to about 1 X lo' inside V„ gene primer 
lolecules, about 1. x 10* V. gene molecules to about 1 x^0 
outside V t gene primer molecules, about 1 x 10 V gene 
molecules to about 1 x lo' V t gene primer molecules In 
more preferred embodiments, 10* outside V„ gene primer 
molecules and 10 s inside V, gene primer molecules are used 
for every V ( gene molecule present in the PCR admixture 
Similarly, 10< outside V t gene primer molecules and 10 V 
! ne mol cule present in the PCR admixture. Thus there 
Is typically a 10 fold molar excess of outside primer to 
insid primer. 
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in the fusion PCR reaction, the gene repertoires are 
admixed with outside and inside primers, the outside 
primers being present in excess relative to the inside 
primers. The initial PCR thenaocycles produce intermedi- 
ate products having complementary termini from each of the 
first and second gene repertoires. That is, the end of 
one strand from one primary PCR product is capable of 
-hybridizing- vith-the-complementary—end_f rom the other 


25 


primary PCR product. The strands having the overlap at 
their 3' ends can act as primers for one another, i.e., 
from an internally primed duplex, and be extended by the 
polymerase to form the full length final product. The 
final product is then amplified by the set of outside 
primers, which act as a third PCR pair when the inside 
15 primers have been exhausted, to form a secondary PCR 
product. Typically the molar ratio of outside primers to 
inside primers is such that the inside primers are 
effectively exhausted within about. 2 to about 12, 
preferably about 5, 6 or 7 thermocycles . 

The PCR buffer also contains the deoxyribonucleotide 
triphosphates dATP, dCTP, dGTP, and a polymerase, typic- 
ally thermostable, all in adequate amounts for primer 
extension (polynucleotide synthesis) reaction. The 
resulting solution (PCR admixture) is heated to about 90 *C 
- 100* C for about 1 to 10 minutes, preferably from 1 to 4 
minutes. After this heating period the solution is 
allowed to cool to 54 *C, which is preferably for primer 
hybridization. The synthesis reaction may occur at from 
room temperature up to a temperature above which the poly- 
30 merase (inducing agent) no longer functions efficiently. 
Thus, for example, if DNA polymerase is used as inducing 
agent, the temperature is generally no greater than about 
40 -C, An exemplary PCR buffer comprises the following: 
so mM kci; 10 mM Tris-HCi; pH 8.3; 1.5 mM Mgci 2 ; 0.001% 
35 (wt/vol) gelatin, 200 fM dATP; 200 MM dTTP; 200 MM dCTP; 
200 MM dGTP; and 2.5 units acoustlcus DNA poly . 
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m erase I (U.S. Patent No. 4,889,818) per 100 microliters 

of buffer. ■ 

After producing operatively linked y r and V r coding 
DNA homologs for a plurality of different V - and V - 
5 coding genes within the repertoires, the aicxstronic DNA 
B olecules are typically further amplified. Whil. the 
dicistronic DNA molecules can he amplified by classic 

techniques such as i ncorp^ation-into- - an-eutonomously 

replicating vector, it is preferred to first amplify the 

10 molecules by subjecting them to a polymerase chain reac- 
tion (PGR) prior to inserting them into a vector. In 
fact, in preferred strategies, the first and second PGR 
reactions are performed in the same admixture that ±s 
subject to a multiplicity of PGR thermocycles where the 

15 outside primers are in molar excess. Preferably the 
number of PGR thermocycles is at least n + 5, wherein n xs 
the number of PGR thermocycles necessary to decrease by a 
factor of 10, and preferably exhaust, the number of inside 
primers by consumption in the formation of inside primer- 

20 crimed products. . 

A diverse library of dicistronic DNA molecules having 
upstream and downstream cistrons can also be produced by 
combining, in a PGR buffer, double stranded V H and J L 
repertoires, V H and V L outside primers, and an inside 

25 primer having a 3-teminal priming portion, a cistronic 
bridge coding portion, and a 5-terminal inside primer- 
template (primer-coding) portion. The 3-terminal priming 
portion has a nucleotide base sequence complementary to a 
portion of the primer extension product of one of the 

30 outside primers. The 5-terminal primer-template portion 
has a nucleotide base sequence homologous (identxcal) to 
a protion of the primer extension product of the other of 
the outside primers. That is, the linking primer has 
terminal sequences homologous to sequences in both reper- 

35 toires. Th cistronic bridge coding porti n codes f r, 
either directly or through c mplem ntarily, at least one 
stop cod n in the same reading frame as the upstream 
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4-v.o pvnression of the downstream 
cistron and sequences for the expression 

cistron. ^. 

The dicistronic DNA molecules containing operatively 
linked V H - and V L -coding DNA homologs produced by PCR 
amplification are typically in double-stranded in form and 
ma y have contiguous or adjacent to each of their termini 
a nucleotide sequence defining an endonuclease restriction 
_site._Digestion_of_the_dicis^^ 


site. uige&uxuii _ w*. — — - 7 — 

restriction sites at or. near their temini with one or more 
0 appropriate endonucleases results in the production of DNA 
' molecules having cohesive termini of predetermined 
specificity. 

When individual PGR admixtures contain diverse gene 
repertoires the present invention produces many non- 
L5 naturally occurring antibodies, i.e., combinations of V H 
and V. in a heterodimer. To take advantage of the 
mammalian immune system's capacity to select V H and V L 
combinations, the present invention also contemplates 
using fusion PGR to operatively link, and thereby recover, 
20 naturally occurring V„ and V L combinations. _ 
in certain preferred embodiments,; a fusion PCR method 
is performed on repertoires comprising a plurality of 
substantially isolated cells containing genes coding for 
a heterodimeric receptor. For example, a plurality of PCR 
25 admixtures is formed, each of which contains (i) a sample 
of substantially isolated B lymphocytes from a mammal pro- 
ducing antibody molecules against a .preselected antigen, 
rii) a PCR buffer, and (ill) either the previously 
described V H and V L PCR primer pairs or the set of outside 

30 V H and V L PCR primers, in combination with the linking 
primer(s) , also as previously described. The plurality of 
PCR admixtures is then subjected to a multiplicity of PCR 
thermocycles as described herein. 

By "substantially is lat d" is meant a sample 

35 containing less than about 100 target cells, such as B 
lynphocytes, T cells, and the like. In preferred embodi- 
ments, the plurality of PCR admixtures contain only ab ut 
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one cell. The cells are typically obtained fro- an indi- 
vidual Whose serum contains antibody molecules 
against the presented antigen. The collected cells are 
typically seeded, usually at densities in the range of 0.5 
6 tTloO cells per unit volume, into a plurality of indi- 
vidual PC* vessels, such as microtiter plate wells and the 
like, usually, the plurality of PCR admixtures is in the 
^rangT of 8.0 to 1200, and ^ferably-is-about-iooo,_ 

separate admixtures. 

^ „~<i 1 e are needed in each PCR 

in Typically, fewer cells are neeaea 

10 xypi ob tained from individuals 

admixture where the cells are craw. 

expressing a high serum antibody titer against. the pre- 
selected antigen. Fcr example, where B lymphocytes are 
obtained from an individual having a frequency of ^circu- 
X5 lating B cells producing the antibody **~>*~r ° f 
deselected specificity of 1/3000, each of about 800 to 
1200 individual PCR admixtures need only contain about one 
B lymphocyte to result in isolation of the desxred anti- 
body. Where the circulating B cell frequency is in the 
20 range of 1/500,000, a density of about 100 cells per PCR 
20 range oi ±/ , individual PCR 

admixture in each of about 800 xo 

admixtures will be needed before the process will result 
in isolation of the desired antibody. 

in preferred embodiments, the PCR process is used not 
25 only to produce a library of dicistronic DNA molecules, 
but also to induce mutations within the library or to 
create diversity from a single parental clone and thereby 
provide a library ' having a greater heterogeneity as noted 
in section 3a hereinabove. 

30 4. repression 

A. pTl "*T1 """or .V. ™" Homologs, 

The V H - and/or V L -coding DNA homologs contained within 
the library produced by the above-described meth d can by 
operatives linXed to a vector for amplification and/or 

35 expression. 
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The choice of vector to which a V r and/or V L -coding 
DNA homolog is operatively linked depends directly, as is 
well known in the art, on the functional properties 
desired, e.g., replication or protein expression, and the 
host cell to be transformed, these being limitations 
inherent in the art of constructing recombinant DNA 
molecules. In preferred embodiments, the vector utilized 
includes^a-procarYotic^replicon i.e., a DNA sequence 
having the ability to direct autonomous replication and 
maintenance of the recombinant DNA molecule extra chromo- 
somally in a procaryotic host cell, such as a bacterial 
host cell, transformed therewith. Such replicons are well 
known in the art. In addition, those embodiments that 
include a procaryotic replicon also include a gene whose 
expression confers a selective advantage, such as drug 
resistance, to a bacterial host transformed therewith. 
Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin or tetracycline. 

Those vectors that include a procaryotic replicon can 
also include a procaryotic promoter capable of directing 
the expression (transcription and translation) of the V H - 
and/or V L -coding homologs in a bacterial host cell, such 
as e. coli transformed therewith. A promoter is an 
expression control element formed by a DNA sequence that 
permits binding of RNA polymerase and transcription to 
occur. Promoter sequences compatible with bacterial hosts 
are typically provided in plasmid vectors containing 
convenience restriction sites for insertion of a DNA 
segment of the present invention. Typical of such vector 
plasmids are pUC8, pUC9, P BR322, and pBR329 available from 
BioRad Laboratories, (Richmond, CA) and pPL and pKK223 
available from Pharmacia, (Piscataway, NJ) . 

Promoters, contain two highly conserved regions, one 
located about 10 bp (-10 region on Priberrow box) and the 
other about 35 bp (-35 region) upstream from the p int at 
which transcription starts. Thes two regions typically 
determine promoter strength. In addition, the number of 
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construction and use of retroviral vectors has been 
described by Sorge et al., Hoi CpI . Biol,, 41730-1737 
1984). 

A variety of methods have been developed to opera- 
tive^ link DNA to vectors via complementary cohesive 
termini. For instance, complementary cohesive termini can 
be engineered into the V„- and/or V L -coding DNA homologs 
-during the-primer-extensi6n-reaction_b y use of an appro- 


10 


25 


30 


as 


priately designed polynucleotide synthesis primer, 
previously discussed. The vector, and DNA homolog if 
necessary, is cleaved with a restriction endonuclease to 
produce termini complementary to those of the DNA homolog. 
The complementary cohesive termini of the vector and the 
DNA homolog are then operatively linked (ligated) to 
15 produce a unitary double stranded DNA molecule. 

in preferred embodiments, the V„-coding and V L -coding 
DNA homologs of diverse libraries are randomly combined in 
vitro for polycistronic expression from individual 
vectors. That is, a diverse population of double stranded 
20 DNA expression vectors is produced wherein each vector 
expresses, under the control of a single promoter, one V H - 
coding DNA homolog and one V L -coding DNA homolog, the 
diversity of the population being the result of different 
V- and V t -coding DNA homolog combinations. 

" Random combination in vitro can be accomplished using 
two expression vectors distinguished from one another by 
the location on each of a restriction site common to both. 
Preferably the vectors are linear double stranded DNA, 
such as a Lambda Zap derived vector as described herein, 
in the first vector, the site is located between a promo- 
ter and a polylinker, i.e., 5' terminal (upstream relative 
to the direction of expression) to the polylinker by 3' 
terminal (downstream relative to the direction of expres- 
sion) . in the second vector, the polylinker is located 
between a promoter and the restriction site, i.e., the 
restriction site is located 3 ' terminal to th polylinker, 
and polylinker is located 3 • terminal t the prom ter. 


35 
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in preferred embodiments , each of the vectors defines 
in preierc ri bosome binding and a 

o nucleotide sequence coding for a rxDos«« 
a nucleotide s «a _ locate d between the promoter 

leader, the sequence being ^ d terBli nal) from the 

, nn1v1 inker, but downstream (3 
and the polylinker, o bet ween the promo- 

te chared restriction site if that site is « 

«r "1 Winker. Also preferred are vectors containing 
rsCccccn downstream from the polylin*er. but upstream 
— — a s wy w _j«.,> j-f that site is aown- 

,^ 0 _ -nv shared restriction-site-if— t na^ » 

TfrcB the polylinfcer. The ««V^« 
l0 vector can also define a ^ ^ 

peptide tec. the tac eeouenc^i . y ^ 

stream from the polylinker but upstream 
codon that may be present. vcctors contain 

in preferred embodiments, the vector 
L5 selectable makers *uch that the presence of a P°rtion-f 
15 ^ vector, i.e.aparticuur^aar^^^ 
for or selected against. Typical selectable markers are 
w !u known to those skilled in the art. Examples* such 
lasers are antibiotic resistance genes, genetically 
20 Selectable makers, mutation suppressors such as amber 
20 selects selectable markers are 

suppressors and the like. Th ter and/or down- 

typically located upstream of the p £erred 
stream of the second restriction site. in P 
stream « marke r is located upstream of 

events. ~£^£TJ**i*>* «- V~«»» 

stream of the second restriction site on 
CU. the V.-codinc DHA homolocs. This 

^rxer may he the same or ^^J^Z^Z' 
. . ori the v -coding vectors and the V L -coaing 
30 • as when the V„ cooing res triction site the 

randomly combined via the first res 

ran»w*uj.jr and both 

resulting vectors containing both V H ana v t 
«=oieetable markers can be selected. 

Ml ^icaxxy the polylin*.r is a nucleotide sequence 
3S that ^ ines on or more, preferably at least two 
restriction sites, each unigu to th vector, i.e., if it 
ZT£*r~ v ct r, it is not on the second vector. 
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The polylinker restriction sit s are oriented to permit 
ligation of V„- or V L -coding DNA homologs into the vector 
in same reading frame as any leader, tag or stop codon 
sequence present. 
5 Random combination is accomplished by ligating V H - 

coding DNA homologs into the first vector, typically at a 
restriction site or sites within the polylinker. 
Similarl y, V r coding DNA homologs are ligated into the 
second vector, thereby creating two diverse populations of 
10 expression vectors. It does not matter which type of DNA 
homolog, i.e., V H or V L , is ligated to which vector, but it 
is preferred, for example, that all V„-coding DNA homologs 
are ligated to either the first of second vector, and all 
of the V t -coding DNA homologs are ligated to the other of 
15 the first or second vector. The members of both popula- 
tions are then cleaved with an endonuclease at the shared 
restriction site, typically by digesting both populations 
with same enzyme. The resulting product is two diverse 
populations of restriction fragments where the members of 
20 one have cohesive termini complementary to the cohesive 
termini of the members of the other. The restriction 
fragments of the two populations are randomly ligated to 
one another, - i.e., a random, interpopulation ligation is 
performed, to produce a diverse population of vectors each 
25 having a V M -coding and V L -codihg DNA homolog located in the 
same reading frame and under the control of second 
vector's promoter. Of course, subsequent recombinations 
can be effected through cleavage at the shared restriction 
site, which is typically reformed upon ligation of members 
30 from the two populations, followed by subsequent 
religations . 

The resulting construct is then introduced into an 
appropriate host to provide amplification and/ or expres- 
sion of the V H - and/or v L -coding DNA homologs, either 
35 separately or in combination. When coexpressed within the 
same organism, eith r n the same or the difference 
vc trs, a functionally active Fv is produced. When the 
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V H and V L polypeptides are expressed in different organ 
iLns, the respective polypeptides are isolated and then 
combined in an appropriate medium to form a Fv. Cellular 
h osts into which a V H - and/or V L -coding DNA homolog- 
containing construct has been introduced are referred to 
herein as having been "transformed., or as trans formants . 

The host cell can be either procaryotic or eucary- 
otic. Bacterial cells are preferred^procaryotic host 


15 


20 


cells and typically are a strain of E^coU such as, for 
10 example, the coli strain DH5 available from Bethesda 
Research Laboratories, Inc., Bethesda, MD. Preferred 
eucaryotic host cells include yeast and mammalian cells, 
preferably vertebrate cells such as those from a mouse, 
rat. monkey or human cell line. ^ w 

Transformation of appropriate cell hosts with a 
recombinant. DNA molecule of the present invention is 
accomplished by methods that typically depend on the type 
of vector used. With regard to transformation of procary- 
otic host cells, see, for example, Cohen et al. , 
^^Ws ^ dem v of Scjepce, USA Vol. 69, P. 

2110 (1972); and Haniatis et al., M n lcm^r rioninq. a 
manual . cold Spring Harbor Laboratory, Cold 
spring Harbor, NY (1982). With regard to the transfor- 
ation of vertebrate cells with retroviral vectors 
25 containing rDNAs, see for example, Sorge et al- ^ 
-, wm. Biol. . 4:1730-1737 (1984); Graham et al., Virol,., 
52 56 (1973); and Wigler et al . , rroco.ni ^ ^tiPP^- 
„, .fences . USA, Vol. 76, P. 1373-1376 (1979) . 

30 * The dicistronic DNA molecules produced by the above- 
described method can be operatively linked to a vector for 
amplification and/or expression. 

A variety of methods have been developed to oepra- 
tively link DNA to vect rs via complem netary cohesive 

35 termini. For instance, complementary cohesive termini can 
b ngineered int the dicistronic DNA molecules during 
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the primer extension reaction by use of an appropriately 
designed polynucleotide synthesis primer, as previously 
discussed. The dicistronic DNA molecule, and vector if 
necessary, is cleaved with a restriction endonuclease to 
5 produce termini complementary to htose of the vector. The 
complementary cohesive termini of the vector and the 
dicistronic DNA molecule are then operatively linked 
(ligated) to produce a unitary double stranded DNA 
molecule. 

10 The present method produces a diverse population of 

double stranded DNA expression vectors wherein each vector 
expresses, under the control of a single promoter, one V H - 
coding DNA homolog and one V L -coding DNA homolog, the 
diversity of the populuation being the result of different 
15 V„- and V L -coding DNA homolog combination that occurs 
during the PCR reaction where both outside and both inside 
primers are present in effective amounts. Preferably the 
vectors are linear double stranded DNA, such as a Lambda 
Zap derived vector as described herein. 
20 in preferred embodiments, the vector defines a 

nucleotide sequence coding for a rihosome binding site and 
a leader, the sequence being located downstream from a 
promoter and upstream from a sequence ceding for apoly- 
peptide leader. In preferred embodiments, the vector 
25 contains a selectable marker such that the presence of a 
dicistronic DNA molecule of this invention inserted into 
the vector, can be selected. Typical selectable markers 
are well known to those skilled in the art. Examples of 
such markers are antibiotic resistance genes, genetically 
30 selectable markers, mutation suppressors such as amber 
supppresso'rs and the like. The selectable markers are 
typically located upstream of the promoter. 

The resulting construct is then introduced into an 
appropriate host to provide amplification and/or expres- 
35 sion of the V„- and V L -coding DNA homologs. When 
coexpressed within th same organism, a functionally 
active heterodimeric receptor, such as an F v , is produced* 
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cellular hosts into which a V H - and V t -coding DNA homolog- 
containing construct has been introduced are referred to 
herein as having. been. "transformed" or as "transf ormants" . 

The host cell can be either prokaryotic or 
eukaryotic. Bacterial cells are preferred prokaryotic 
h ost cells for library screening, and typically are a 
strain of IL-Ssli such as, for example, the E^SfiU strain 
nu, available from Bethesda Research l^ oratorles^jnc^ 


" B ethesda, MD. Preferred eukaryotic host cells include 
10 yeast and mammalian cells, preferably vertebrate cells 
such as those from a mouse, rat, monkey or human cell 

^'Transformation of appropriate cell hosts with a 
recombinant DNA molecule of the present invention is 

15 accomplished by methods that typically depend on the type 
of vector used. With regard to transformation of prokary- 
otic host cells, see, for example, Cohen et al., 
^ i^d. sci.. USA, 69:2110 (1972); and Maniatis et 
^ nnT--^-- * Moratory Manual, Cold .Sprxng 

20 Harbor, NY (1982). With regard to the transformation of 
vertebrate cells with retorviral vectors containing rDNAs, 
see for example, Sorge et al., y ol o» U. Biol. ,, 4:1730- 
1737 (1984); Graham et al., yirpJL. , 52:456 (1973); and 
Wigler et al., TT ■ ■ Sel " USA ' 76:1373-1376 

25 (1979). 

5 r - r ~-<™ **r e nf "„ rn— P»lv^id«« 

successfully transformed cells, i.e., cells contain- 
ing a V H - and/or V L -coding DNA homolog or a dicistromic DNA 
molecule operatives linked to a vector, can be identified 

30 by any suitable well known technique for detecting the 
binding of a receptor to a ligand or the presence of a 
polynucleotide coding for the receptor, preferably its 
active site. Preferred scr ening assays are those where 
the binding of ligand by the receptor produc s a detect- 

35 able signal, either directly r indirectly. Such signals 
include, for example, th production of a complex, 
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formation of a catalytic reaction product, the release or 
uptake of energy, and the like. For example, cells from 
a population subjected to transformation with a subject 
rDNA can be cloned to produce monoclonal colonies. Cells 
5 form those colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using a 
method such as that described Southern, J. flol T Bio}. , 
__o.R. g .S03 (1975) or Berent et al., Biotech ,. 3t 208 (1985). ^ 

In addition to directly assaying for the presence of 
10 a V„- and/or V L -coding DNA homolog or a dicistronic DNA 
molecule, successful transformation can be confirmed by 
well known immunological methods, especially when the V H 
and/or V L polypeptides produced contain a preselected 
epitope. For example, samples of cells suspected of being 
15 transformed are assayed for the presence of the 
preselected epitope using an antibody against the epitope. 

6. V M - ftTV VnT- V, -Codi ng Gene Libraries 

According to one aspect, the present invention 
contemplates a gene library, preferably produced by a 
20 primer extension reaction or combination of primer 
extension reactions as described herein, containing at 
least about 10 3 , preferably at least about 10* and more 
preferably at least about 10 5 different V„- and/or VL- 
coding DNA homologs. The homologs are preferably in an 
25 isolated form, that is, substantially free of materials 
such as, for example, primer extension reaction agents 
and/or substrates, genomic DNA segments, and the like. 

In preferred embodiments, a substantial portion of 
the homologs present in the library are operatively linked 
30 to a vector, preferably operatively linked for expression 
to an expression vector. 

Preferably, the homologs are present in a medium 
suitable for in vitro manipulation, such as water, water 
containing buffering salts, and the like. The medium 
35 should be compatible with maintaining the activity of the 
homologs. In addition, the homologs should be present at 
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a concentration sufficient to allow transformation of 
host cell compatible therewith at reasonable frequencies. 

It is further preferred that the homologs be present 
in compatible host cells transformed therewith. 

5 c. Expressi on Vectors 

The present invention also contemplates various 
expression vectors useful in performing, inter alia , the 
methods of the present invention. Each of the expression 
vectors is a novel derivative of Lambda Zap vector. 

10 1. Lambda Zap II 

Lambda Zap II is prepared by replacing the Lambda S 
gene of the vector Lambda Zap with the Lambda S gene from 
the Lambda gtlO vector, as described in Example 6. 


15 


20 


2 . y.ainbda Z ap II V,, 

Lambda Zap II V„ is prepared by inserting the 
synthetic DNA sequences illustrated in Figure 6A into the 
above-described Lambda Zap II vector. The inserted 
nucleotide sequence advantageously provides a ribosome 
binding site (Shine-Dalgarno sequence) to permit proper 
initiation of mRNA translation into protein, and a leader 
sequence to efficiently direct the translated protein to 
the periplasm. The preparation of Lambda Zap II V H is 
described in more detail in Example 9, and its features 
illustrated in Figures 6A and 7. 

25 3. J-ambda Zap TI V L 

Lambda Zap II V t is prepared as described in Example 
12 by inserting into Lambda Zap II the synthetic DNA 
sequence illustrated in Figure 6B. Important features of 
Lambda Zap II V L are illustrated in Figure 8. 
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4 . Lambda Z ap II V L II 

Lambda Zap II V t II is prepared as described in 
Example 11 by inserting into Lambda Zap II the synthetic 
DNA sequence illustrated in Figure 10. 

5 5. HCFLP 

HCFLP is prepared as described in Example 20 by 
inserting a flp sequence containing EcoRI compatible e nds 
into the EcoRI site^of the lambda zap ^Tvj^ectbr. 

6. LCFLP 

10 LCFLP is prepared as described in Example 20 by 

inserting a flp sequence containing EcoRI compatible ends 
into the EcoRI site of the lambda Zap II V L vector. 

7. Lambda ImmunoZAP H 

Lambda ImmunoZAP H is prepared by inserting the 
15 synthetic DNA sequences illustrated in Figure 25A into the 
above-described Lambda Zap II vector. The inserted 
nucleotide sequence advantageously provides a ribosome 
binding site (Shine-Dalgarno sequence) to permit proper 
initiation of mRHA translation into protein, and a leader 
20 sequence to efficiently direct the translated protein to 
the periplasm. The preparation of Lambda ImmunoZAP H is 
described in more detail in Example 28, and its features 
illustrated in Figures [25A] and [26]. 

8. Modified Lambda ImmunoZAP H 

25 Modified Lambda ImmunoZAP H is prepared by inserting 

the modified synthetic DNA sequences illustrated in Figure 
8A into the above-described Lambda ZAP II vector. The 
preparation' of modified Lambda ImmunoZAP H and the details 
of the modifications are described in Example 28B. Its 

30 features are illustrated in Figure [24A] and [24B] . 
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9. f-ambda T ™™»"oZAP L 

Lambda ImmunoZAP L is prepared as described in 
Example 29 by inserting into Lambda ZAP II the synthetic 
DNA sequence illustrated in Figure 6B. Important features 
5 of Lambda ImmunoZAP L are illustrated in Figure 27. 

The above-described vectors are compatible with 
coli hosts, J^, they can express for secretion into the 

periplasm proteins coded, for by genes to which they have 

been operatively linked for expression. 

10 EXSSE T ht following examples are intended to illustrate, 
but not limit, the scope of the invention. 

!. ph^nnfvoe Creation 

. m order to obtain lambda phage clones with a range 

15 of desired phenotypes, a combinatorial library selection 
system was used to generate a diverse collection of 
clones. This approach utilized two starting populations 
of lambda phage clones which can be restriction digested, 
mixed, ligated, and packaged to form a library of clones 

20 containing DNA sequences from each of the two P^<*°" 
of parent phage. The following example outlines the 
method for rapid construction and selection of lambda 
phage clones containing properties from each of the two 
parent phage populations derived from lambda WT (cI857 

25 indl, Sam7) and lambda gtll (SamlOO) . 

Forty micrograms of a population of lambda phage 
derived from wild type lambda (WT) DNA (CI857 Sam7) 
(available from New England Biolabs) was partially 
digested with lambda HindlH as determined by ethidium 

30 bromide staining on 0.8% agarose gels (Maniatis et aJL. , 
-Molecular Cloning,". Cold Spring Harbor Laboratory 
(1982)). Forty micrograms of a second phage population 
d rived from lambda gtll DNA (available f rom stratag ne 
Cloning Systems, San Diego, CA) was-digested to completion 

35 with HindlH. subsequently, this gtll DNA was digested 
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t-o reduce the cloning 

tfith . second enzyme BamHI » orter « re ^ ^ 

*«™ o £ ^^ g r p ^rX amplified 
,1., aiP»). Both „ plia9 ;. P ° P . rel atively high degree of 
iytically. vhich allowed *« Bloro , rim „ f the 

5 mutations in the resultwg DN*. „ , « , 

1M hda - - - in ^ -u»e 

ofj^^s^han^l^accorti^^ 

^^aSon^^^^P^ systeBS , 

10 packaging extract. Gigapa=K £trat 9 

San Diego, CM , as "V***" containe d a mixture of 
The packaged phage library co ^ ^ 

M „y lambda phage ^ stm = tiC ^-. 6<slectlo „ was used to 
.esired construction., ^f^^aylng vigorous 
15 identify those members of «»1 ry ^ ^ ^utls 

srowth on sup* phage li6 rary was plated 

e£ Si., SUESa. dilutions ing syst ems. San 

with E. coli C600 cells <Stratag iso iated 

D iego,oM - -^"^ plagues are result of 
20 lambda places. These slJ>oe 
clonal expansion from a sing" individual 
C6 00 cells are supE. ^^J^ \y «e size of the 
lamb da phage clones The parental «T 

iambda Phage plague on the J^±^ f ^ least ^e 
25 phage do not form places subseq uently categor- 

ol asses of phage were siie . The large 

uea as ^ l 7 n ;^r 0 f v^orous growth on^e 

plaque size was an inai inai cated poor growth. 

ph age la wn, while small Plaque sxzex^ ^ ^ ^ 

30 This demonstrates selection for^ cQUld be 

S gene based on plaque size. Other P 

used for selection. restr iction mapping 

subsequent characterization by res 

and plating on supO » Seated that 

35 suppressing ^J^J^J^' ». 

at least one of the ^ ^ WQ ln ^ lambda WT 
not contain an amber mutation as una 
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(Sam7) or lambda gtll (C5100) parent phage. One of the 
small plaque phage, S2, contained the left arm of lambda 
WT gene and the right arm of lambda gtll containing the 
• samlOO gene. This samioo mutation is known to grow poorly 
5 on supE hosts and is optimal on a supF strain, with no 
growth on a supO host. The remaining of clo " es 

displayed several differ ent phenotypes , dictated ay the 
-^^X ^e^ ity-of^ e-two ritS^ 

g ^71I s 7~g;h^^ that resulte d from the 

10 ^ndo^Tassortment of two mutant DNA f ragments derived from 
j ust one of the parent DNA molecules. THis illustrates 
concept that the two genes that give rise to the 
populations of interest need not be on separate DNA 
molecules at the start of the method. 
15 Due to the phenotypic selection applied following the 

ligation and packaging of the phage library, the large 
diversity of these two populations of phage was not com- 
pletely analyzed. However, the range of clones identified 
with alterm*te_S_gene_j3h^^ 

diversity. The diversity in these two populations of 
iSmbdT phage is believed to be derived from the low level 
of spontaneous mutations which occur through repeated 
rounds of replication required in large scale preparations 
of lambda phage. However, the spontaneous mutations 
occurring within each of these individual phage popula- 
tions could not generate a collec£ ion_of lambda phage 
containing char acteristics of both pare nt populations of 
phage. This combinatorial approa ch^ therefore, provides 
a me chanism in which novel constru cti ons can be generated 
^TZcoress genes from both. parent phage constructions. 

2. Pol YmieleolHrtP- Select *™ Tmmnnocrl obul jn Product Ip p 

The nucleotide sequences encoding the immunoglobulin 
protein CDR's are highly variable. However, th re are 
sev ral regions of conserved s quences that flank the V„ 
domains. For instance, contain substantially conserved 
nucl tide s quences, s quences that will hybridize 
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to the same primer sequence. Th refore, polynucleotide 
synthesis (amplification) primers that hybridize to the 
conserved sequences and . incorporate restriction sites into 
the DNA homolog produced that are suitable for operatively 
5 linking the synthesized DNA fragments to a vector were 
constructed. More specifically, the DNA homologs were 
inserted into lambda Zap II vector (Stratagene Cloning 

System , San Dieg o , CA) at the Xhol and Bc o RI sites. For 

amplification of the V„ domains, the 3 \ primer (primer 67 
10 in Table 7) , was designed to be complementary to the mRNA 
in the J H region. In all cases, the 5« primers (primers 
56-65,, Table 7) were chosen to be complementary to the 
first strand cDNA in the conserved N-terminus region 
(antisense strand) . Initially amplification was performed 
15 with a mixture of 32 primers (primer 56, Table 7) that 
were degenerate at five positions. Hybridoma mRNA could 
be amplified with mixed primers, but initial attempts to 
amplify mRNA from spleen yielded variable results. 
Therefore, several alternatives to amplification using the 
20 mixed 5' primers were compared. 

The first alternative was to construct multiple 
unique primers, eight of which are shown in Table 7, 
corresponding to individual members of the mixed primer 
pool. The individual primers 52-64 of Table 7 were 
25 constructed by incorporating either of the two possible 
nucleotides at three of the five degenerate positions. 

The second alternative was to construct a primer 
containing inosine (primer 65, table 7) at four of the 
variable positions based on the published work of 
Takahashi, et al., ?T™- Natl, ftcad, Sc*- fU t S f ft T V , 
82:1931-1935, (1985) and Ohtsuka et al., J, Pfroj-t Chem, , 
260:2605-2608, (1985) . This primer has the advantage that 
it is not degenerate and, at the same time minimizes the 
negative effects of mismatches at the unconsented posi- 
ti ns as discussed by Martin et al., 1*u t Acids Res ., 
13:8927 (1985) . However, it was not known if the presence 
f inosine nucleotides would result in incorp ration of 


30 


35 


SUBSTITUTE 


SHEET 


WO 91/16427 


PCT/US91/02910 


81 


unwanted sequences in the cloned V„ regions. Therefore, 
inosine was not included at the one position that remains 
in the amplified fragments after the cleavage of the 
restriction sites. As a result, inosine was not in the 

cloned insert. 

Additional, V H amplification primers including the 
unique 3' primer were designed to be complementary to a 
portion of-the-first constant-region-domain-of--the_gamma_ 


1 heavy chain mRNA (Primers 70 and 71, Table 7). These 
10 primers will produce DNA homologs containing polynucleo- 
tides coding for amino acids from the V H and the first 
constant region domains of the heavy chain. These DNA 
homologs can therefore be used to produce Fab fragments 

rather than an F v . 

1S As a control for amplification from spleen or 

hybridoma mRNA, a set of primers hybridizing to a highly 
conserved region within the constant region igG, heavy 
chain gene were constructed. The 5' primer (primer 66, 
Table 7) is complementary to the cDNA in the C H 2 region 

20 whereas the 3 • primer (primer 68, Table 7) is complement- 
ary to the mRNA in the C„3 region. It is believed that no 
mismatches Were present between these primers and their 
templates. 

The nucleotide sequences encoding the V L CDRs are 
25 highly variable. However, there are several regions of 
conserved sequences that flank the V L CDR domains including 
the J L , V L framework regions and V L leader/promoter. 
Therefore, amplification primers that hybridize to the 
conserved sequences and incorporate restriction sites that 
30 allowing cloning the amplified fragments, into the 
pBluescript SK-vector cut with Nco I and Spel were con- 
structed. For amplification of the V L CDR domains, the 3» 
primer (primer 69 in Table 7), was designed to be comple- 
mentary to the mRNA in the J L regions. The 5- primer 
35 (primer 70, Table 7) was chosen to be c mplementary to the 
first strand cDNA in the conserved N-terminus r gion 
(antisense strand) 
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A second set of amplification primers for amplifica- 
tion of the V L CDR domains the 5« primers (primers 73-80 in 
Table 8) were designed to be complementary to the first 
strand cDNA in the conserved N-terminus region. These 
primers also introduced a Sac I restriction endonuclease 
site to allow the FLDNA homolog to be cloned into the V L II- 
cxpression vector. The 3- V L amplification primer (primer 
_ 83 _ ln _ Table -8 : )-was-designed-to_be_c^ ^ 


10 


iorna in the J t regions and to introduce the Xbal restric- 
tion endonuclease site required to insert the V L DNA homolog 
into the v L II-expression vector (Figure 8) . 

Additional 3' V t amplification primers were designed 
to hybridize to the constant region of either kappa or 
lambda jnRNA (primers 82 and 83 in Table 8) . These primers 
15 allow a DNA homolog to be produced containing polynucleo- 
tide sequences coding for constant region amino acids of 
either kappa or lambda chain. These primers make it 
possible to produce an Fab fragment rather than an F v . 

The primers used for amplification of kappa light 
20 chain sequences for construction of Fabs are shown at 
least in Table 8. Amplification with these primers was 
performed in 5 separate reactions, each containing one of 
the 5' primers (primers 75-78, and 84) and one of the 3- 
primer (primer 81) has been used to construct F v fragments. 
25 The 5« primers contain a Sac I restriction site and the 3' 
primers contain a Xbal restriction site. 

The primers used for amplification of heavy chain Fd 
fragments for construction of Fabs are shown at least in 
Table 7. Amplification was performed in eight separate 
30 reactions, each containing one of the 5« primers (primers 
57-64) and one of the 3- primers (primer 70). The remain- 
ing 5' primers that have been used for amplification in a 
single reaction are either a degenerate primer (primer 56) 
or a primer that incorporates inosine at four degenerate 
35 positions (primer 66, Table 7, and primers 89 and 90, 
Table 8). The remaining 3' primer (primer 86, Table 8) 
has been used to c nstruct F v fragments. Many of th 5' 
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and 3' primers include 
primers incorporate a Xho I site, and 

a spel restriction site. , m - imaA to amplify human 

v t amplification pnmers i an4 ^pp* 

Ught chain variable regions- of both the 

. isotypes are also ^^^ieotiaes used herein 
M1 primers and „***«p^ fron 

and shovm on Tables ' 11 or ^thesUed on 

- R esearch-Genetics_injuntsvme^^ 

aB Applied Biosystems DHA synthesizer. 


an Applied Biosysrau* ^ -*» 
10 the manufacturer's instructions. 
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35 


r^^^rj ft v„ coti m »«n>rteirP Enriched }n 
vjtc Bin H ^ ng f Proteins 

Fluorescein isothiocyanate (FITC) was selected as a 
ligand for receptor binding. It was further decided to 
enrich by immunization the immunological gene repertoire, 
i.e., V H - and V L -coding gene repertoires, for genes coding 
for antl-FITC receptors. This was accomplished by linking 
FITC-to-keyhole-limpet-hemocyanin-(KIfl)- usin g the tech- 


niaues described in AnHhodl^ ft ^oratory Manual , Harlow 
and Lowe, eds., Cold Spring Harbor, New York, (1988). 
Briefly, 10.0 milligrams (mg) of keyhole limpet hemocyanin 
and 0.5 mg of FITC were added to 1 ml of buffer containing 
0 l M sodium carbonate at pH 9.6 and stirred for 18 to 24 
hours at 4 degrees C (4C) . The unbound FITC was removed 
by gel filtration through Sephadex G-25. 

The KLH-FITC conjugate was prepared for injunction 
into mice by adding 100 W of the conjugate to 250 Ml of 
phosphate buffered saline (PBS). An equal volume of com- 
plete Freund's adjuvant was added and the entire solution. 
20 was emulsified for 5 minutes. A 129 G 1X , mouse was 
injected with 300 Ml of the emulsion. Injections were 
aiven subcutaneously at several sites using a 21 gauge 
needle. A second immunization with ; KLH-FITC was given two 
week later. This injection was prepared as follows: 
fifty /xg of KLH-FITC were diluted in 250 ML of PBS and an 
egual volume of alum was admixed to the KLH-FITC solution. 
The mouse was injected intraperitoneal^ with 500 Ml of 
the solution using a 23 gauge needle. One month later the 
mi ce were given a final injection of 50 _g of the KLH- 
FITC conjugate diluted to 200 _L in PBS. This injection 
was given intravenously in the lateral tail vein using a 
30 gauge needle. Five days after this final injection the 
B ice were sacrificed and total cellular RNA was isolated 

from th ir spleens. 

Hybrid ma PCP 8D11 producing an antibody immuno- 
specif ic for phosphonate ester was cultured in DMEM media 
(Gibco Laboratories, Grand Island, New York) containing 10 
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percent fetal calf serum supplemented with penicillin and 
streptomycin. About 5 x 10 8 hybridoma cells were harvested 
and washed twice in phosphate buffered saline. Total 
cellular RNA was prepared from these isolated hybridoma 
5 cells. 

4 . Preparation Of ft V n -rn«H ng Gene Repertoire 

Total cellular RNA was prepared from the spleen of a 


15 


20 


single mouse immunized with KLH-FITC as described in 
Example 3 using the RNA preparation methods described by 
10 Chomczynski et al., finnl Bjochem., 162:156-159 (1987) 
using the manufacturer's instructions and the RNA isola- 
tion kit produced by Stratagene cloning systems, La Jolla, 
CA. Briefly, immediately after removing the spleen from 
the immunized mouse, the tissue was homogenized in 10 ml 
of a denaturing solution containing 4.0 M guanine Isothio- 
cyanate, 0.25 M sodium citrate at pH 7.0, and 0.1 M 2- 
mercaptoethanol using a glass homogenizer. One ml of 
sodium acetate at a concentration of 2 M at pH 4.0 was 
admixed with the homogenized spleen. One ml of phenol 
that had been previously saturated with H 2 0 was also 
admixed to the denaturing solution containing • the homo- 
genized spleen. Two ml of a chloroform: isoamyl alcohol 
(24:1 v/v) mixture was added to this homogenate. The 
homogenate was mixed vigorously for ten seconds and 
25 maintained on ice for 15 minutes. The homogenate was then 
transferred to a thick-walled 50 ml polypropylene centri- 
fuge tube (Fisher scientific Company, Pittsburgh, PA). 
The solution was centrifuged at 10,000 x g for 20 minutes 
at 4*C. The upper RNA-containing aqueous layer was 
transferred to a fresh 50 ml polypropylene centrifuge tube 
and mixed with an equal volume of isopropyl alcohol. This 
solution was maintained at -20* C for at least one hour to 
precipitate the RNA. The solution containing the precipi- 
tated RNA was centrifuged at 10,000 x g for twenty minutes 
at 4*C. The pelleted total cellular RNA was collected and 
dissolv d in 3 ml of th denaturing solution described 
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above. Three ml of isopropyl alcohol was added to the 
resuspended total cellular RNA and vigorously mixed. This 
solution was maintained at -20'C for at least 1 hour to 
precipitate the RNA. The solution containing the precipi- 
5 tated RNA was centrifuged at 10,000 x g for ten minutes at 
4-C The pelleted RNA was washed once with a solution 
containing 75% ethanol. The pelleted RNA was dried under 
— vacuum-for-15-minutes_and_then_resus pended to dimethyl 

pyrocarbonate (DEPC) treated up <DEPC-H 2 0) . 
10 Messenger RNA (mRNA) enriched for sequences contain- 

ing long poly A tracts was prepared from the total 
cellular RNA using methods described in N^lar Clwim 
^-w^.^, Manual. Haniatias et al., eds. Cold ^Spring 
Harbor Laboratory, New York, (1982) . Briefly, one half of 
15 the total RNA isolated from a single immunized Bouse 
spleen prepared as described above was resuspended in one 
m l of DEPC-I^O and maintained at 65'C for five minutes, 
one ml of 2x high salt loading buffer consisting of 100 mM 
Tris-HCl, 1M sodium chloride, 2.0 mM disodium ethylene 
20 diamine tetraacetic acid (EDTA) at: P H 7.5, and 0 2% sodium 
dodecyl sulfate (SDS) was added to the resuspended RNA and 
the mixture allowed to cool to room temperature. The 
mixture was then applied to an oligo-dT (Collaborative 
Research Type 2 or Type 3) column that was previously 
25 prepared by washing the oligo-dT with a solution contain- 
L 0.1 M sodium hydroxide and 5 mM EDTA and then 
equilibrating the column with DEPC-H 2 0. The eluate was 
collected in a sterile polypropylene tube and reapplied to 
the same column after heating the eluate for 5 minutes at 
30 65-C The oligo dT. column was then washed with 2 ml of 
high salt loading buffer consisting of 50 mM Tris-HCl at 
pH % 5 500 mM sodium chloride, 1 mM EDTA at pH 7.5 and 
0 1% SDS . The oligo dT column was then washed with 2 ml 
oHX medium salt buffer consisting of 50 mM Tris-HCl at 
p H 7 5 100 mM sodium chloride 1 mM EDTA and 0.1% SDS. 
The m s's ng r RNA was eluted from the oligo dT column with 
xml of buffer consisting of 10 mM Tris-HCl. at P H 7.5, 1 *M 
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EDTA at P H 7.5 and 0.05% SDS. The messeng r RNA was puri- 
fied by extracting this solution with phenol/chloroform 
followed by a single extraction with 100% chloroform. The 
messenger RNA was concentrated by ethanol precipitation 
and resuspended in DEPC- 1^0 . 

The messenger RNA isolated by the above process 
contains a plurality of different V„ coding polynucleo- 
tides, i.e., greater than about 10 4 different V„-coding 


genes . 

10 s. pr oration nf & single V.. Coding Polynucleotide 

Polynucleotides coding for a single V H were isolated 
according to Example 4 except total cellular RNA was 
extracted from monoclonal hybridoma cells prepared in. 
Example 3. The polynucleotides isolated in this manner 

15 code for a single V„. 

6. pNA Homoloo Prep aration 

in preparation for PCR amplification, mRNA prepared 
according to the above examples was used as a template for 
cDNA synthesis by a primer extension reaction. In a 
20 typical 50 ul transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first hybridized (annealed) 
with 500 ng (50.0 pmol) of the 3' V„ primer (primer 67, 
Table 7), at 65* C for five minutes. Subsequently, the 
mixture was adjusted to 1.5 mM dATP, dCTP, dGTP and dTTP, 
25 40 mM Tris-HCi at PH 8*0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM 
spermidine. Moloney-Murine Leukemia virus Reverse 
transcriptase (Stratagene Cloning Systems) , 26 units, was 
added and the solution was maintained for 1 hours at 37 «C. 
PCR amplification was performed in a 100 ul reaction 
30 containing the products of the reverse transcription 
reaction (approximately 5 ug of the cDNA/RNA hybrid) , 300 
ng of 3' v„ primer (primer 67 of Table 7) , 300 ng each of 
the 5' V R priaers (primer 57-65 of Table 7) 200 mM of a 
mixture of dNTP's, 50 mM KCl, 10 mM Tris-HCl pH 8.3, 15 mM 
35 MgCl 2 , 0.1% gelatin and 2 units f Taq DNA polymerase. The 
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reaction mixture was overlaid with mineral oil and sub- 
jected to 40 cycles of amplification. Each amplification 
cycle involved denaturation at 92'C for 1 minute, anneal- 
ing at 52 -C for 2 minutes and polynucleotide synthesis by 
5 Primer extension (elongation) at 72'C for 1.5 minutes. 
The amplified V„-coding DNA homolog containing samples were 
extracted twice with phen ol/chloroform, once with chloro- 
form , etha^oT^e^ipTtaTO^d-wer^store 
roM Tris-HCl, <pH, 7.5) and 1 mM EDTA. 

10 Using unique 5' primers (57-64, Table 7), efficient 

V H -coding DNA homolog synthesis and amplification from the 
spleen mRNA was achieved as shown in Figure 3, lanes R17- 
K24. The amplified cDNA (V„-coding DNA homolog) is seen as 
a major band of the expected size (360 bp) . The intensi- 

15 ties of the amplified V H -coding polynucleotide fragment in 
each reaction appear to be similar, indicating that all of 
these primers are about equally efficient in initiating 
amplification. The yield and quality of the amplification 
with these primers was reproducible . 

20. The primer containing inosine also synthesized ampli- 

fied V H -cbding DNA homologs from spleen mRNA reproducibly, 
leading to the production of the expected sized fragment, 
of an intensity similar to that of the other amplified 
cDNAs (Figure 4, lane R16) . This result indicated that 

25 the presence of inosine also permits efficient DNA homolog 
synthesis and amplification. Clearly indicating how 
useful such primers are in generating a plurality of v„- 
coding DNA homologs. Amplification products obtained from 
the constant region primers (primers 66 and 68, Table 7) 

30 were more intense indicating that amplification was more 
efficient, possibly because of a higher degree of homology 
between the template and primers (Figure 4, Lang R9) . 
Based on these results, a V r coding gene library was 
constructed from the products of eight amplifications, 

35 each performed with a different 5' primer. Equal portions 
of the products from each primer extension reacti n were 
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nixed and the mixed product was then us d to generate a 
library of V H -coding DNA homolog-containing vectors. 

DNA homologs of the V L were prepared from the purified 
mRNA prepared as described above. In preparation for PCR 
amplification, mRNA prepared according to the above 
examples was used as a template for cDNA synthesis. In a 
typical 50 ul transcription reaction, 5-10 ug of spleen or 
hybridoma_mRNA_in^water_was_f i rst annealed with 3 00_Jig 


(50.0 pmol) of the 3 • V L primer (primer 69, Table 7) , at 
L0 65 'C for five minutes. Subsequently, the mixture was 
adjusted to 1.5 mM dATP, dCTP, dGTP, and dTTP, 40 mM Tris- 
HCl. at pH 8.0, 8 mM MgCl 2 , 50 mM NaCl,. and 2 mM spermidine. 
Moloney-Murine Leukemia virus reverse transcriptase 
(Stratagene Cloning Systems) , 26 units, was added and the 
15 solution was maintained for 1 hour at 37 'C. The PCR 
amplification was performed in a 100 ul reaction contain- 
ing approximately 5 ug of the cDNA/RNA hybrid produced as 
described above, 300 ng of the 3- V t primer (primer 69 of 
Table 7), 300 ng of the 5 • V L primer (primer 70 of Table 
20 7) , 200 mM of a mixture of dNTP's, 50 mM KC1, 10 mM Tris- 
HC1 P H 8.3, 15 mM MgCl 2 , 0.1% gelatin and 2 units of Taq 
DNA polymerase. The reaction mixture was overlaid with 
mineral oil and subjected to 40 cycles of amplification. 
Each amplification cycle involved denaturation at 92 -C for 
25 i minute, annealing at 52'C for 2 minutes and elongation 
at 72 'C for 1.5 minutes. The amplified samples were 
extracted twice with phenol/chloroform, once with chloro- 
form, ethanol precipitated and were stored at 70'C in 10 
mM Tris-HCl at pH 7.5 and 1 mM EDTA. 

30 7. Jnsgytinq W HomoioaB Tnt.o Vectors 

in preparation for cloning a library enriched in V, 
sequences, PCR amplified products (2.5 mg/30 ul of 150 mM 
NaCl, 8 mM Tris-HCl (pH 7.5), 6 MM MgSO,, 1 mM DTT, 200 
mg/ml bovine serum albumin (BSA) at 37 -C w re digested 

35 with restriction enzymes Xho I (125 units) and Ec R I (10 
U) and purified on a 1% agarose gel. In cl ning experi- 
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10 


15 


20 


25 


30 


ro ents which reguir d a fixture of the products of the 
amplification reactions, equal volumes (50 ul, 1-10 ug 
concentration) of each reaction mixture were combined 
after amplification but before restriction digestion. 
After gel electrophoresis of the digested PCR amplified 
spleen mRNA, the region of the gel containing DNA frag- 
ments of approximately 350 bps was excised, electroeluted 
-intWiiialysis-^^ 


35 


pended in 10 mM Tris-HCl P H 7.5 and 1 mM EDTA to a final 
concentration of 10 ng/ul. Equimolar amounts of the 
insert were then ligated overnight at 5'C to 1 ug of 
Lambda Zap™ II vector (Stratagene Cloning Systems, La 
jolla, CA) previously cut by EcoR I and Xho I. A portion 
of the ligation mixture (1 ul) was packaged for 2 hours at 
room temperature using Gigapack Gold packaging extract 
(Stratagene Cloning Systems, La Jolla, CA) , and the pack- 
aged material was plated on ILl-blue host cells. The 
Library was determined to consist of 2 x 10 V„ homologs 
with less than 30% noh-recombinaht background. 

The vector used above, Lambda Zap II is a derivative 
of the original Lambda zap (ATCC # 40,298) that maintains 
all of the characteristics of the original Lambda Zap 
including 6 unique cloning sites, fusion protein expres- 
sion, and the ability to rapidly excise the insert in the 
form of a phagemid (Bluescript SK-) , but lacks the SAM 100 
mutation, allowing growth on many Non-Sup F strains, 
including XLl-Blue. The Lambda Zap II was constructed as 
described in Short et al., Nnneic ficjds Res. , 16:7583- 
7600, (1988), by replacing the Lambda S gene contained in 
a 4254 base pair (bp) DNA fragment produced by digesting 
Lambda Zap with the restriction enzyme Ncol. This 4254 bp 
DNA fragment was replaced with the 4254 bp DNA fragment 
containing the Lambda S gene isolated from Lambda gtlO 
(ATCC # 40,179) after digesting the vector with the 
rstrictin enzyme Nc I. Th 4254 bp DNA fragment 
isolated fr m lambda gtio was ligated into the original 
Lambda Zap vector using T4 DNA ligase and standard proto- 


SUBSTrTUTE 


SHEET 


WO 91/16427 


PCT/US91/02910 


94 


cols for such procedures described in ^rrPTit Protocols in 
r ~^^ Moleav. Ausubel et al. ( eds., John Wxley and 

sons, New York, (1987). 

in preparation of cloning a library enriched in V L 
sequences, 2 ug of PGR amplified products (2.5 mg/30 ul. of 
150 mM KaCl, 8 mM Tris-HCl ( P H 7.5), 6 mM MgSo,, 1 mM DTT, 
200 mg/ml BSA) were digested with restriction enzymes Nco 
-I-po unites)-and-S P e^I-(45-units)_at 3 7'C for 2 hour s- 


The digested PGR amplified products were purified on 1% 
10 agarose gel using standard electroelution technique des- 
cribed in r— * ^ oratory Manual, Maniatxs 
et al., eds.. Cold Spring Harbor,. New York, (1982). 
Briefly, after gel electroelution of the digested pgr 
amplified product the region of the gel containing the V - 
15 coding DNA fragment of the appropriate size was excised, 
electroelution into a dialysis membrane, ethanol precxpi- 
tated and resuspended at a final concentration of 10 ng 
per ml in a solution containing 10 mM Tris-HCL at pH 7.5 

and 1 mM EDTA. , , 

20 An equal molar amount of DNA representing a plurality 

of different V L -coding DNA homologs was ligated to a 
pBluescript 8K- phagemid vector that had been previously 
cut with' Nco I and Spe I. A portion of the ligation mxx- 
ture was transformed using the manufacturer's instructions 
25 into Epicuian Coli XLl-Blue competent cells (Stratagene 
Cloning Systems, La Jolla, CA) . The transformant library 
vas determined to consist of 1.2 x 10 3 colony formxng 
units/ug of V L homologs with less than 3% non-recombinant 
background. 

30 8 c a rcn H. g of ™-~™« thP v -coding cDNft Library 

To analyze the Lambda Zap II phage clones, the clones 
were excised from Lambda Zap into plasmids according to 
the manufacture • s instructions (Stratagene Cloning System, 
La Jolla, CA) . Briefly, phag plaques were cored from the 
35 agar plates and transferred to sterile microfuge tubes 
containing 500 ul a buffer containing 50 mM Tris-HCL at pH 
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10 


15 


7 . 5 , 100 »M Macl, 10 mM,Mg S o t , and 0.01% gelatin and 20 uL 

°' nr^icns. 200 ul of the phage stocK, ,00 ul of 
XH-Blue ceUs (»* - LOO and 1 ul of MM 
(1 x 10" pfu/ml> were incubated at 37 -c for 15 minutes 
T»e excised plasmids were infected into XU-Blue cells and 
rtatT ontc V Plate, containing ampiciUin. Double 
d anced DMA was. pr epared~from-th.-ph.gemid 

according to the methods described by Hol»es et .1 . . 
^_ Ei2Sb e^. 114:193. (1981). Clones were first 
keened for DNA inserts by restriction digests with 
7£Z Pvu XX or Bgl X and clones containing the putative 
V insert were sequenced using reverse transcrxptase 
Lording to the general method described by Sanger et 
^ 9 „ rt1 C .<.. OS A. 74 = 5463-5467. (X977, end 

specific modifications ; of this method provided in the 
^nufacturer-s instruction in the reverse transcript- 
ase »s-dATP sequencing Kit from Stratagene Cloning 
Systems, La Jolla, CA. 

^ -. r-^^.ic of Th» nonel Y „ P^rtoire _ 
The amplified products which had been digested with 
Xfco ! and EcoK I and cloned into I*mbda zap, resulte in 
a^cDHfi library with *.o x lo' pfu's. In order tc . confirm 
that the library consisted of a diverse population of V 
coding DMA homologs, the N -termin.l «o bases of is 
clones selected at random from the library, were excised 
n s gu need figure 5, . To determine if the Clones were 
V, gtne origin, the cloned sequences were 
torn V„ sequences and V, sequences. The clones exhibited 
STm to .0% homology with sequences of Known beavy 
chain origin and little homology with sequences o : light 
Sain origin when compared with the sequences available in 

tTal", 4tTed . U.S. Dept. of Health and Human sciences, 
'l 987) This demonstrated that the library was enriched 
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the desired V H sequence in preferenc to other 
sequences, such as light chain sequences. 

The diversity of the population was assessed by 
classifying the sequenced clones into predefined subgroups 
(Figure 5, . Mouse V H sequenpes are classified into eleven 
subgroups (Figure 5). Mouse V K sequences are classxf ied 
into eleven subgroups [I (A,B,), II (A,B,C), IH *>*' C >°[ 
v -(A7B)rbased-on-framewor^^^ 


10 


15 


20 


25 


Lot et al., 4th ed., U.S. Dept. of Healt^ and ^ Human 
sciences/ (1987) ; Dildrop, JimnnmwY Today , 5:84, 1984 . 
and Brodeur et al., fiir .T TT™unol,, 14:922, (1984). 
Classification of the sequenced clones demonstrated that 
the cDNA library contained V H sequences of at least 7 
different subgroups. Further, a pairwise comparison of 
the homology between the sequenced clones showed that no 
two sequences w*re identical at all positions, suggestxng 
that the population is diverse to the extent that it xs 
oossible to characterize by sequence analysis. 

Six of the clones <L 36-50, Figure 5) belong to the 
subclass IXI B and had very similar nucleotide sequences. 
This may reflect a preponderance of mRNA derived from one 
or several related variable genes in stimulated spleen, 
but the data does not permit ruling out the possibility of 
a bias in the amplification process. 

10 yRvnresF ^r v^tor construction 

The main criterion used in choosing a vector system 
was the necessity of generating the largest number of Fab 
fragments which could be screened directly. Bacteriophage 
Xambda was selected as the expression vector for three 
reasons. First, in vi£ro. packaging of phage DNA is a 
highly efficient method of reintroducing DNA into host 
cells. Second, it is possible to detect protein expres- 
sion at the level of single phage plaques. Finally, the 
screening of phage libraries typically involve less 
difficulty with nonspecific binding. An alternative, 
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T ~a=e is not lost in the present system becaase of the 

z°rr^:zi, ***** — ^ * ^rr^z 

Z heavy chain, light chain, or rah expressing inserts 

*° the plurality o f V.-codin, *HA ho-oloos in 
'^nT^^^^l^a-vector-vas^onstructed-that^lacea- 
an E^SSll n° B nroDe r reading frame. 
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15 


20 


^^SaH-host-ce^, reading £raTO . 

the V„-codln, BH* bowlegs in the PJJ ^ ^ 

a ribosome binding si^e a© 

1975, provided a leader seguence 
al.r na^SESr periplastic space, 

directing the expressed protein to the pe P 

provided a polynucleotide , seguence ^that 

epitope .epitope ^"^^ J V,-coding OK, 
* 1~ »d Z pol^rieotide coding for the epitope tag. 
h0n lletL b^r^ence containing all of the ahove 

s)mth Ctl ' * „ " !Ltures was constructed by designing 
polynucleotides and features was 

L£e stranded po lynucleotide ^egaents ^ 
that would hybridize to each other and tor. 
branded synthetic « ~ 
individual single-stranded polynucleotia 


leot des ... ». --4., Hll, »»-. »«.« 
» and N^e Kinase* by adding 1 ul of each polynucleotide 
^ l ug/ul) and 20 units of T4 polynucleotide Kinase to a 
(0.1 ug/ux) an« * MgCl 2 
solution containing 70 m Trxs-HCl a t pH 7.6. 1 
5 nM DTT, 10 mM 2-»ercaptoethanol (2ME), 501 g 

! f Llvnucleotides HI end polynucleotides «M, were 
"Jl to Se etve casing reaction soiution together 
with l/io voium ^ thU solu . 

" T tfhl^ ; ToZZ -inutes and allowed to c =1 
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a 500 ml beaker of water. During this time period all 10 
polynucleotides annealed to form the double stranded 
synthetic DNA insert shown in Figure 6A. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert by adding 40 ul of the 
above reaction to a solution containing 50 mM Tris-HCl at 
pH 7.5, 7 mM MgCl 2 , 1 mM DTT, 1 mM adenosine triphosphate 
-(ATP) an* 1" ""its of T4 DNA li gase. This solution was 


15 


20 


maintained at 37 *C for 30 minutes and then the T4 DNA 
10 ligase was inactivated by maintaining the solution at 65 'C 
for 10 minutes. The end polynucleotides were kinased by 
mixing 52 ul of the above reaction, 4 ul of a solution 
containing 10 mM ATP and 5 units of T4 polynucleotide 
kinase. This solution was maintained at 37 *C for 30 
minutes. The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been previ- 
ously digested with the restriction enzymes Hot I and Xho 
I. The ligation mixture was packaged according to the 
manufacture's instructions using Gigapack II Gold packing 
extract available from Stratagene Cloning Systems, La 
Jolla, CA. The packaged ligation mixture was plated on 
XL1 blue cells (Stratagene Cloning Systems, San Diego, 
CA). Individual lambda Zap II plaques were cored and the 
inserts excised according to the in vivo excision protocol 
25 provided by the manufacturer, Stratagene Cloning Systems, 
La Jolla, CA. This in vivo excision protocol moves the 
cloned insert from the lambda Zap II vector into a plasmid 
vector to allow easy manipulation and sequencing. The 
accuracy of the above cloning steps was confirmed by 
sequencing the. insert using' the Sanger dideoxide method 
described in by Sanger et al., proc. Natl . Acad. Scj USft , 
74:5463-5467, (1977) and using the manufacture's instruc- 
tion in the AMV Reverse Transcriptase ^S-ATP sequencing 
kit from stratagene Cloning Systems, La Jolla, CA. The 
sequence of the resulting V„ expression vector is shown in 
Figure 6A and Figure 7. 
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^Z7i) 5' GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3' 

92 N2 5 • AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 • 

93 N3) 5- GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3' 

M N4) 5- AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 3 

(95) N5) 5' TCGACTATTAACTAGTCTAGAATTCTCGAG 3- 

(96) N6) 


5 • TCGACTATXftA«-±«»-«.w*» 

5 . cAGTTTCACCTGGGCCATGGCTGGTTGGG 3« 

; 97r N7) 5«— GAGGGAGTAATAACAATCCAGCGG CTGCCGTAGGCAATAG 3' 

98 H8> 5- GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 3- 
Ml N9-4) 5. AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 

100) Nil) 5- gAcgttccggactacggttcttaatagaattcg 3- 

101) N12) 5- TCGACGAATTCTATTAAGAACCGTAGTC 3« 

[To 2) N10-5, 5 CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3 

,! v, T-y pressi -n v*r*-*r. Construction 
L5 ?o express the plurality of V L coding Polynucleotides 

in an «■ ^ host cell, a vector was constructed that 
placed the V t coding polynucleotide in the proper ^readi ng 
frame,- provided a ribosome binding site as described by 

20 sequence directing the expressed protein to the piro 
Pernio space and also provided a 

coded for a spacer protein between the V L polynucleotide 
and the polynucleotide coding for the epitope tag. A 
synthetic DNA sequence containing air °* 
25 nucleotides and features was constructed by designing 
si le stranded polynucleotide segments of 20^0 bases 
II would hybridize to each other and for^ «* double 
stranded synthetic DNA sequence shown in Figure 6B. The 
. dividual jingle-stranded polynucleotides (N1-N8) are 

30 «. -< - *■ « 

ki nased by adding 1 «1 of each polynucleotide and 20 units 
^ T4 polynucl otide kinase to a solution containing 70 mM 
of T4 poiynuc 2ME, 500 

Tri6-HCL at pH 7.6, 10 mM MgCl 2 , 5 W ^x, 
35 xerograms P r ml of BSA. The soluti n was maintain d at 
3^c L 30 minutes and the reaction stopped by maintain- 
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The two end 


ing the solution at 65 *C for 10 minutes, 
polynucleotides 20 ng of polynucleotides Nl and poly- 
nucleotides N5 were added to the above kinasing reaction 
solution together with 1/10 volume of a solution contain- 
5 ing 20.0 mM Tris-HCL at pH 7.4, 2.0 mM MgCl 2 and 50.0 mM 
NaCl. This solution was heated to 70'C for 5 minutes and 
allowed to cool to room temperature, approximately 25'C, 
j_oyera^hours_J^_a^ 

time period all the polynucleotides annealed to form the 
10 double stranded synthetic DNA insert. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert with adding 40 ul of 
the above reaction to a solution containing 50 ul Trxs- 
HCI» at pH 7.5, 7 mM MgCl 2 , 1 mM DTT, 1 mM ATP and 10 units 
15 to T4 DNA ligase. This solution was maintained at 37'C 
for 30 minutes and then the T4 DNA ligase was Inactivated 
by maintaining the solution at 65 -C for 10 minutes. The 
end polynucleotides were kinased by mixing 52 ul of the 
above reaction, 4 ul of a solution recontaining 10 mM ATP 
20 and 5 units of T4 polynucleotide kinase. This solution 
was maintained at 37"C for 30 minutes and then the T4 
polynucleotide kinase was inactivated by maintaining the 
solution at 65 'C for 10 minutes. The completed synthetic 
DNA insert was ligated directly into a lambda Zap II 
25 vector that had been previously digested with the restric- 
tion enzymes Not I and Xhq I. The ligation mixture was 
packaged according to the manufacture ' s instructions using 
Gigapack II Gold packing extract available from Stratagene 
Cloning Systems, La Jolla, CA. The packaged ligation 
30 mixture was plated on XLl-Blue cells (Stratagene Cloning 
Systems, La Jolla, CA) . Individual lambda Zap II plaques 
were cored and the inserts excised according to the in 
vivo excision protocol provided by the manufacturer, 
Stratag ne Cloning Systems, La Jolla, CA and described in 
35 Short t il., Unrein • 11=7583-7600 (1988). 

This jn vivo excision prot col moves the cloned insert 
from the lambda Zap II vect r into a phagemid vector to 
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al lov easy manipulation and sequencing and also produces 
the phagemid version of the V t expression vectors. The 
accuracy of the above cloning steps was conf irmed by 
sequencing the insert using the Sanger dideoxide method 
5 described by Sanger et al., frnr Hitl , 

74:5463-5467, (1977) and using the manufacturer's xnstruc- 
^;„s in the AMV reverse transcriptase »S-dATP sequencing 
kit from Stratagene Cloning-Systems— La-Jolla^CA.__The_ 
sequence of the resulting V t -expression vector is shown in 

10 Figure 6 and Figure 8. 

The V L expression vector used to construct the V L 
Xibrary was the phagemid produced to allow the DNA of the 
V expression vector to be determined. The phagemid was 
produced, as detailed above, by the An_vJ*o excision 

15 process from the Lambda Zap V L expression vector (Figure 
8, . The phagemid version of this vector was used because 
the Nco I restriction enzyme site is unique in this 
version a nd thus could be used to operatively lixOced the 
V L DNA homologs into the expression vector. 

, Q 12 v. TT-FYDres ^nn vector Construction 

* to express the plurality of V t -coding DNA homologs in 
an E. coli host cell, a vector was constructed that placed 
the v L -coding DNA homologs in the proper reading frame, 
provided a ribosome binding site as described by Shine et 
25 al Nature . 254:34, 1975, provided the Pel B gene leader 
sequence that has been previously used to successfully 
secrete Fab fragments in B,,wU by Lei et al., J, Bag. . 
169 :4379 (19B7) and Better et : al 


35 


169: 43 ' y IJ-so*; ~- - 

(1988), .and also provided a polynucleotide containing a 
restriction endonuclease site for cloning. A synthetic 
DNA sequence containing all of the above polynucleotides 
and features was constructed by designing single stranded 
polynucleotide segments of 20-60 bases that would hybrid- 
Le to each other and form th double stranded synthetic 
DNA sequence shown in Figure 10. The sequence of each 
individual single-stranded polynucleotides (01-08, within 
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the double stranded synthetic DNA sequence is shown in 
Table 10. 

Polynucleotides 02, 03, 04, 05, 06 and 07 were 
kinased by adding 1 ul (0.1 ug/ul) of each polynucleotide 
and 20 units of T4 polynucleotide kinase to a solution 
containing 70 mM Tris-HCL at P H 7.6, 10 mM magnesium 
chloride (MgCl) , 5 mM dithiothreitol (DTT) , 10 mM 2- 
mercaptoethanol (2ME) , 500 micrograms per ml of bovine 
serum albumin. The" solution was maintained at 37*C for 30 


10 minutes and the reaction stopped by maintaining the 
solution at 65'C for 10 minutes. The 20 ng each of the 
two end polynucleotides, 01 and 08, were added to the 
above kinasing reaction solution together with 1/10 volume 
of a solution containing 20.0 mM Tris-HCl at pH 7.4, 2.0 

15 mM MgCl and 15.0 mM sodium chloride (NaCl) . This solution 
was heated to 70 'C for 5 minutes and allowed to cool to 
room temperature, approximately 25*C, over 1.5 hours in a 
500 ml beaker of water. During this time period all 8 
polynucleotides annealed to form the double stranded 

20 synthetic DNA insert shown in Figure 9. The individual 
polynucleotides were covalently linked to each other to 
stabilize the synthetic DNA insert by adding 40 ul of the 
above reaction to a solution containing 50 ml Tris-HCl at 
pH 7.5, 7 ml MgCl, 1 mm DTT, 1 mm ATP and 10 units of T4 

25 DNA ligase. This solution was maintained at 37 *C for 30 
minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65'C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 ul of the above 
reaction, 4 ul of a solution containing 10 mM ATP and 5 

30 units of T4 polynucleotide kinase. This solution was 
maintained at 37«C for 30 minutes and then the T4 poly- 
nucleotide kinase was inactivated by maintaining the 
solution at 65'C for 10 minutes. The completed synthetic 
DNA insert was ligated directly into a lambda Zap II 

35 vector that had b n previously digested with the restric- 
tion enzymes Not I and Xho I. The ligation mixture was 
packaged according to the manufacturer's instructions 
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using Gigapack II cold packing extract available, from 
^ratagene cloning systems. La Jolla, CX. The packaged 
Stratogei , cells (Stratagene 

ligation mixture vas plated on XL1 blue ce* \ 
Caning system. San Diego, CM- Individual lambda Sap II 
5 piques were cored and the inserts excised according to 
11 in vivo excision protocol provided by the manufac- 
turer, Stratagene Cloning Systems, La Jolla, CA. This in 
^ exci- s icn^rotocol,..oves^tha_ cloned insert rom t ^_ 
iLbda Sap II vector into a plasmid vector to -"°»J^ 
10 .ambulation and sequencing. The accuracy of the above 
10 ^nlng steps was confirmed by sequencing £. 

the manufacturer's instructions in the AMV Reverse 
Transcriptase "s-dATP sequencing kit from ^atagene 
cloning systems, I* Jolla, <*. The sequence of the 
15 resting V^I-expression vector is shown in Figure 9 and 
Figure 11. 


5- TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAT 3« 
(103) 02) 5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3' 


STTATTACTCGCTGCCCAACCAGCCATGGCC 3 

• C 

(106) 05) 5 1 


20 (104) 03) 5' Gl 

(105) 04) 5' GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3' 


'CTCCTTGGCGACTAGTTTAGAATTCAAGCT 


GTATTTCATTATGACTGT 

3 


25 (107) 06) 5' CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 
(108) 07) 5 


TGACGAGCTCGGCCATGGCTGGTTGGG 3 


(109) 08) 
13 


TCGACGGCCGCTTAACTCTAGAAC 


v * v. l^j V'^yv construction, 

^T^epare an expression library enriched in V H 
30 sequences, DNA ho»ologs enriched in V H sequences were 
prepared according to Example 7 using the sa»e set of 5 
pLlrs but with primer 62A (Table 7, as the 
HZ homologs were then dig sted with the restriction 
enzy.es Xho I and S P e I and purified on a 1% agarose gel 
35 ZZ the standard electroelution t chnique described in 
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g iTni,r clor ^g * ^boratorv Manual, Maniatis et al. , 
ei., Cold spring Harbor, New York, (1982) . These 
prepared V„ DNA homologs were then directly inserted xnto 
the V„ expression vector that had been previously digested 

5 with Xho I and Spe I. 

The ligation fixture containing the V„ DNA homologs 
were packaged according to the manufacturers specifica- 
— tions-using-G-igapack-Gold^II-Packing_E j ctract (Stratagene 
Cloning Systems, La Jolla, CA) . The expression libraries 
10 were then ready to be plated on XL-1 Blue cells. 

Tb prepare a library enriched in V t sequences, PCR 
amplified products enriched in V L sequences were Prepared 
according to Example 7. The V L DNA homologs were digested 
with restriction enzymes Nco I and Spe I. The digested V L 
15 DNA homologs were purified on a -1% agarose gel using 
standard electrelusion techniques described in Molecular 
rn ^ir*,.* tebc r.^ Manual. Maniatis et al,, eds., Cold 
Spring Harbor, NY (1982). The prepared V L DNA homologs 
were directly inserted into the V L expression vector that 
20 had been previously digested with the restriction enzymes 
Nco I and spe 1. The ligation mixture containing the V L 
DNA homologs were transformed into XL-1 blue competent 
cells using the manufacturer's instructions (Stratagene 
Cloning systems. La Jolla, CA) . 

25 14. -1— v, .<*"'"" rm Tlnffmngf Tlltn y, Expression 
Vector 

in preparation for cloning a library enriched in V L 
sequences, PCR amplified products (2.5 ug/30 ul of 150 mM 
Nad, 8 mM Tris-HCl ( P H 7.5), 6 »M MgCl„ 1 mM DTT, 200 

30 ug/ml BSA at 37'C were digested with restriction enzymes 
Sac I (125- units) and Xba I (125 units) and purified on a 
1% agarose gel. In cloning experiments which required a 
mixture of the products of the amplification reactions, 
equal v lumes (50 ul, 1-10 ug c ncentrati n) of each 

35 reaction mixture were combined after amplification but 
before restricti n digestion. After gel electrophoresis 
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of the digested PCR amplified spleen mRNA, the region of 
the gel containing DNA fragments of approximately 350 bps 
was excised, electroeluted into a dialysis membrane, 
ethanol precipitated and resuspended in a TE solution 
containing 10 mM Tris-HCl P H 7.5 and 1 mM EDTA to a final 

concentration of 50 ng/ul. 

The V L II-expression DNA vector was prepared for 


cloning cy acuuj.*>-«*y 

containing 250 units each of the restriction endonucleases 
10 Sac 1 and Xba 1 (both from Boehringer Mannheim, 
mdianapolis, IN) and a buffer recommended by the manu- 
facturer. This solution was maintained at 37-C for 1.5 
hours. The solution was heated at 65-C for 15 minutes to 
inactivate the restriction endonucleases. The solution 

15 was chilled to 30'C and 25 units of heat-killable HK) 
phosphatase (Epicenter, Madison, WI) and CaCl 2 were admixed 
to it according to the manufacturer's specifications. 
This solution was maintained at 30'C for 1 hour. The DNA 
was purified by extracting the solution with a mixture of 

20 phenol and chloroform followed by ethanol precipitation. 
The V L II expression vector was now ready for ligation to 
the V. DNA homologs prepared in the above examples. 

DNA homolog enriched in V L sequences were prepared 
according to Example 6 but using a 5- light chain primer 

25 and 3. light chain primer shown in Table 9. Individual 
amplification reactions were carried out using each 5 
light chain primer in combination with the 3- light chain 
primer. These separate V L homolog-containing reaction 
fixtures were mixed and digested with the restriction 

30 endonucleases Sac 1 and Xba 1 according to Example 7. The 
V homologs were purified on a 1% agarose gel using the 
standard electrocution technique described in MolgsHlaE 
» T^tm Manual., Maniatis et al., eds.. Cold 
Spring Harbor, New York, (1982). These prepared V L DNA 

35 homologs were then directly inserted into the Sac 1 - Xba 
cl aved v.Il-expression vector that was prepared above by 
ligating 3 moles f V L DNA homolog inserts with each »ole 
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Tris-HCL at pH 7.5. The DNA was then ethanol precipitated 
and resuspended in 100 ul of water. This solution was 
admixed with 100 units of the restriction endonucl ease 
EcoR I (Boehringer Mannheim, Indianapolis, IN) in a final 
5 volume of 200 ul of buffer containing the components 
specified by the manufacturer. This solution was main- 
tained at 37 'C for 1.5 hours and the solution was then 
extracted with a mixture of phenol and chloroform. The 

10 TE. 

The restriction digested V H and V L II-expression 
Libraries were ligated together. The ligation reaction 
consisted of 1 ug of V„ and 1 ug of V L II phage library DNA 
in a 10 ul reaction using the reagents supplied in a liga- 
15 tion kit purchased from Stratagene Cloning Systems (La 
jolla, CA). After ligation for 16 hr at 4'C, l ul of the 
ligated the phage DNA was packaged with Gigapack Gold II 
packaging extract and plated on XL 1-blue cells prepared 
according the manufacturer's instructions. A portion of 
the 3X10 6 clones obtained were used to determine the 
effectiveness of the combination. The resulting V H and V t 
expression vector is shown in Figure 11. 

Clones containing both V H and were excised from the 
phage to pBluescript using the jn vitro excision protocol 
25 described by Short etal. ( Nnelelc Acid Research. 16L7583- 
7600 (1988) . Clones chosen for excision expressed the 
decapeptide tag and did not cleave X-gal in the presence 
of 2mM IPTG, thus remaining white. Clones with these 
characteristics represented 30% of the library. 50% of 
30 the clones chosen for excision contained a V„ and V L as 
determined by restriction analysis. Since approximately 
30% of the clones in the v„ library expressed the decapep- 
tide tag and 50% of the clones in the V t II library 
contained a V t sequence it was anticipated that no more 
35 than 15% of the clones in the combined library would 
contain both V H and V L clones. The actual number obtained 


20 
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was 15% of the library indicating that the process of 
combination was very efficient. Process of 

5 To segregate the individual clones containing DNA 

homologs that code for a V ant4 0 «« 

titre „ " antigen bi «ding protein, the 

™e ,r " ; XPrCSSi ° n lib «*y P-P«ed according to 
Example 12 was determined. Thi, n^rr tit^U.... 
performed using methods well known to ^ Z rT 

10 ser±ai diiutions - 

* ' 5 yeast extract, io a/r, 

amine (casein hydrolysate, and 0 .7* melted, 50-c aal ' 
^ phage, the bacteria and the top agar „ "e ; 

«tract, 10 g/l, H2 amir, (casein hydrolysate) and IS o/L 
Difco .gar. The pl ates vera maintained at 37-c for L to 
24 hours during which ttae period th. i„ M , 
20 developed on the hacteriai lawn' ^e Trl 
counted to determine the tota! number of p^gue form!" 
units per mi in the original library. ^ 

«. ■ T itred expresslon lib ™ry vas then plated out so 
that replica filters could be made from the Lbrary Z 
25 replica filters will be used to I,*., 

individual clones in thecal Tat °" ^ 

. . ^ A "»ary that are expressina the 

150 millimeter plate was added to 600 ul of ewonenti.,^ 
30 growing E^U cells end maintained at n-T^u^f 

The 7.5 mi of top agar was admixed to the solution 
contain^ the bacterial cells and the absorbed phage ^hd 

.S ?t\Z~7 ""^^ -my across the' sLa« 

33 or a prewarmed bacterial aoar t>i*+^ 

AttA agar plate. This process was 

repeated for a sufficient numb r of plates to plate ou^ 
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total number of plaques At least equal to the library 
size. These plates were then maintained at 37 'C for 5 
hours. The plates were then overlaid with nitrocellulose 
filters that had been pretreated with a solution contain- 
5 ing 10 nM isopropyl-beta-D-thiogalactopyanoside (IPTG) and 
maintained at 37 'C for 4 hours. The orientation of the 
nitrocellulose filters in relation to the plate were 
marked by punching a hole with a needle dipped in water- 
proof ink through the filter and into the bacterial plates 

10 at several locations. The nitrocellulose filters were 
removed with forceps and washed once in a TBST solution 
containing 20 mM Tris-HCl at pH 7.5, 150 mM NaCl and 0.05% 
monolaurate (tween-20) . A second nitrocellulose filter 
that had also been soaked in a solution containing 10 mM 

15 IPTG was reapplied to the bacterial plates to produce 
duplicate filters. The filters were further washed in a 
fresh solution of TBST for 15 minutes. Filters were then 
placed in a blocking solution consisting 20 mM Tris-HCl at 
P H 7.5, 150 mM NaCl and 1% BSA and agitated for 1 hour at 

20 room temperature. The nitrocellulose filters were trans- 
ferred to a fresh blocking solution containing a 1 to 500 
dilution of the primary antibody and gently agitated for 
at least 1 hour at room temperature. After the filters 
were agitated in the solution containing the primary 

25 antibody the filters were washed 3 to 5 times in TBST for 
5 minutes each time to remove any of the residual unbound 
primary antibody. The filters were transferred into a 
solution containing fresh blocking solution and a 1 to 500 
to a 1 to 1,000 dilution of alkaline phosphatase conju- 

30 gated secondary antibody. The filters were gently 
agitated in the solution for at least 1 hour at room 
temperature. The filters were washed 3 to 5 times in a 
solution of TBST for at least 5 minutes each time to 
remove any r sidual unbound secondary antibody. The 

35 filters were washed once in a solution containing 20 mM 
Tris-HCl at pH 7.5 and 150 mM NaCl. The filters were 
rem ved from this solution and excess moisture blotted 
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from them vith filter paper. The color was developed by 
placing the filter in a solution containing 100 mM Tris- 
HC1 at pH 9.5, 100. mM Had, 5 'mM MgCl 2 , 0.3 mg/ml of nitro 
Blue Tetrazolium (NBT) and 0.15 mg/ml of 5-bromo-4-chloro- 
5 3-indblyl-phosphate (BCIP) for at least 30 minutes at room 
temperature. The residual color development solution was 
rinsed from the filter with a solution containing 20 mM 
Tris-HCl at pH 7.5 and 150 mM NaCl. The filter was then 
—placed in-a-stop-solution-consisting-of-20 mM Tris-HCl at 
10 pH 2.9 and 1 mM EDTA. The development of an intense 
purple color indicates a positive result. The filters are 
used to locate the phage plague that produced the desired 
protein. That phage plague is segregated and then grown 
up for further analysis. 
15 Several different combinations of primary antibodies 

and second antibodies were used. The first combination 
used a primary antibody immunospecif ic for a decapeptide 
that will be expressed only if the V H antigen binding 
protein is expressed in the proper reading frame to allow 
20 read through translation to include the decapeptide epi- 
tope covalently attached to the V„ antigen binding protein. 
This decapeptide epitope and an antibody immunospecif ic 
for this decapeptide epitope was described by Green et 
al., Cell 28:477 (1982) and Niman et al., Proc? Natt ftcafl. 
25 '*"■<- P S- A. 80:4949 (1983). The sequence of the decapep- 
tide recognized is shown in Figure 11. A functional 
equivalent of the monoclonal antibody that is immuno- 
specif ic for the decapeptide can be prepared according to 
the methods of Green et al. and Niman et al. The secon- 
30 dary antibody used with this primary antibody was a goat 
antimouse IgG (Fisher Scientific). This antibody in 
immunospecif ic for the strand region of mouse IgG and did 
not recognize any portion of the variable region of heavy 
chain. This particular combination of primary and secon- 
35 dary antibodies wh n used according t the above protocol 
determined that between 25% and 30% of the clones were 
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expressing the decapeptide and therefore these clones were 
assumed to also be expressing a V H antigen binding protein. 

in another combination the anti-decapeptide mouse 
monoclonal was used as the primary antibody and an af f in- 
5 ity purified goat anti-mouse Ig, commercially available as 
part of the picoBlue immunoscreening kit from Stratagene 
Cloning System, La Jolla, CA, was use as the secondary 
antibody. This combination resulted in a large number of 
false positive clones because the secondary antibody also 

10 immunoreacted with the V, of the heavy chain. Therefore 
this antibody reacted with all clones expressing any V H 
protein and this combination of primary and secondary 
antibodies did not specifically detect clones with the V H 
polynucleotide in the proper reading frame and thus 

15 allowing expressing of the decapeptide. 

Several combinations of primary and secondary anti- 
bodies are used where the primary antibody is conjugated 
to fluorescein isothiocyanate (FITC) and thus the immuno- 
specificity of the antibody was not important because the 

20 antibody is conjugated to the preselected antigen (FITC) 
and it is that antigen that should be bound by the V H 
antigen binding proteins produced by the clones in the 
expression library. After this primary antibody has bound 
by virtue that is FITC conjugated mouse monoclonal anti- 

25 body P 2 5764 (ATCC IHB-9505) . The secondary antibody used 
with this primary antibody is a goat anti-mouse Ig (Fisher 
scientific, Pittsburgh, PA) conjugated to alkaline phos- 
phatase using the method described in antibodies: A 
T^^nT-v Manual. Harlow and Lowe, eds. , Cold Spring 

30 Harbor, New York, (1988) . If a particular clone in the V H 
expression, library, expresses a V H binding protein that 
binds the FITC covalently coupled to the primary antibody, 
the secondary antibody binds specifically and when devel- 
oped the alkaline phosphate causes a distinct purple color 

35 to form. 

The s cond combination of antibodies of th type uses 
a primary antibody that is FITC conjugated rabbit anti- 
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human IgG (Fisher Scientific, Pittsburgh, PA) . The secon- 
dary antibody used with this primary antibody is a goat 
anti-rabbit igG conjugated to alkaline phosphatase using 
- the methods described in A"* bodies ft Moratory Manual y 
5 Harlow and Lane, eds., Cold Spring Harbor, New York, 
(1988) . If a particular clone in the V B expression library 
expresses a V„ binding protein that binds the FITC conju- 

. gate* primary a ntibod y , the secondary antibody 

binds specifically and when developed the alkaline 

10 phosphatase causes a distinct purple color to form. 

Another primary antibody was the mouse monoclonal 
antibody p2 5764 (ATCC # HB-9505) conjugated to both FITC 
and 125 I, The antibody would be bound by any V„ antigen 
binding proteins expressed. Then because the antibody is 

15 also labeled with 12S I, an autoradiogram of the filter is 
made instead of using a secondary antibody that is conju- 
gated to alkaline phosphatase. This direct production of 
an autoradiogram allows segregation of the clones in the 
library expressing a V H antigen binding protein of 

20 interest. 

17. aggregatin g r>KA Homology For f» V.. 3*3 V| *hat Form an 
Antigen Binding F.. 

To segregate the individual clones containing DNA 
homologs that code for a V„ and V L that form an antigen 

25 binding F v , an V H and V t expression library was titred 
according to Example 15. The titred expression library 
was then screened for the presence of the decapeptide tag 
expressed with the V„ using the methods described in 
Example 16. DNA was then prepared from the clones to 

30 express the decapepide tag. This DNA was digested with 
the restriction endonuclease Pvu II to determine whether 
these clones also contained a V L DNA homolog. The slower 
migration of a PvuII restriction endonuclease fragment 
indicated that th particular clone contained b th a V„ and 

35 a V L DNA horn log. 


SUBSTITUTE 


SHEET 


WO 91/16427 


"1 

PCT/US91/02910 


113 


The clones containing" both a V„ and a V t DNA homolog 
" were analyzed to determine whether these clones produced 
an assembled F v protein molecule from the V H and V t DNA 

homologs. , . 

5 The F v protein fragment produced in clones containing 

both V H and V L was visualized by immune precipitation of 
radiolabeled protein expressed in the clones. A 50 ml 
culture of LB broth (5 g/L yeast extract, 10 g/L and tryp - 
tone 10 g/L Nad at pH 7.0) containing 100 ug/ul of 

XQ ampicillin was inoculated with E^coli harboring a plasmid 
contain a V H and a V L . The culture was maintained at 37 *C 
with shaking until the optical density measured at 550 nm 
was 0.5. The culture then was centrifuged at 3,000 g for 
10 minutes and resuspended in 50 ml of M9 media (6 g/L 

15 Na^PO,, 3 g/L KH 2 P0 4f 0.5 g/L Nad, 1 g/L NH 4 Cl, 2g/L 
glucose, 2 mM MgSO, and 0.1 mMgS0 4 CaCl 2 supplemented with 
amino acids without methionine or cysteine. This solution 
was maintained at 37'C for 5 minutes and then 0.5 mCi of 
as HSO, (New England Nuclear, Boston, MA) was added and 

20 the solution was further maintained at #6C for an addi- 
tional 2 hours. The solution was then centrifuged at 300 
x g and the supernatant discarded. The resulting bacter- 
ial cell pellet was frozen and thawed and then resuspended 
for 10 minutes and the resulting pellet discarded. The 

25 supernatant was admixed with 10 ul of anti-decapeptide 
monoclonal antibody and maintained for 30-90 minutes on 
ice. 40 ul of protein G coupled to sepharose beads 
(Pharmacia, Piscataway, NJ) was admixed to the solution 
and the added solution maintained for 30 minutes on ice to 

30 allow an immune precipitate to form. The solution was 
centrifuged at 10,000 x g for 10 minutes and the resulting 
pellet was resuspended in 1 ml of a solution containing 
100 mM Tris-HCl at Ph 7.5 and centrifuged at 10,000 x g 
for 10 minutes. This procedure was repeated twice. The 

35 r suiting immune precipitate pellet was loaded ont a 
PhastGel Homogenous 20 gel (Pharmacia, Piscataway, NJ) 
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according to the manufacturer's directions. The gel was 
dried and used to expose X-ray film. 

The resulting autoradiogram is shown in Figure 12. 
The presence of V L that was immunoprecipitated because it 
5 was attached to the V H -decapepide tag recognized by the 
precipitating antibody. 

18. Generation of a Combinatorial — Library— Of— the 
Immunoglobulin Re pertoire in Phage 

Vectors suitable for expression of V„, V L , Fv and Fab 

10 sequences are diagrammed in Figures 7 and 9. As previ- 
ously discussed, the vectors were constructed by modifi- 
cation of Lambda Zap by inserting synthetic oligonucleo- 
tides into the multiple cloning site. The vectors were 
designed to be antisymmetric with respect to the Not I and 

15 EcoR I restriction sites which flank the cloning and 
expression sequences. As described below, this anti- 
symmetry in the placement of restriction sites in a linear 
vector like bacteriophage allows a library expressing 
light chains to be combined with, one expressing heavy 

20 chains to construct combinatorial Fab expression 
libraries. Lambda Zap II V L II (Figure 9) is designed to 
serve as a cloning vector for light chain fragments and 
Lambda Zap II V„ (Figure 7) is designed to serve as a 
cloning vector for heavy chain sequences in the initial 

25 step of library construction. These vectors are engi- 
neered to efficiently clone the products of PCR 
amplification with specific restriction sites incorporated 
at each end. 

A. pgR AmPlif i cation of Antibody Fragments 
30 The PCR amplification of mRNA isolated from spleen 

cells with oligonucleotides which incorporate restriction 
sites into the ends of the amplified product can be used 
to clone and express heavy chain s quences including Fd 
and kappa chain sequences. The oligonucleotides primers 
35 used f r these amplifications are presented in Tables 1 
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and 2. The primers are analogous to those which have been 
successfully used in Example 6 for amplifications of V H 
sequences. The set of 5' primers for heavy chain ampli- 
fication were identical to those previously used to 
5 amplify V H and those for light chain amplification were 
chosen on similar principles, Sastry et al., Proc, Nat^. 

sci. USA . 80: 5728 (1989) and Orland et al., £roc 
^i, Bfei. PSA . 8G:3833 (1989). The unique 3' 

primers of heavy (IgGl) and light (k) chain sequences were 

10 chosen to include the cysteines involved in heavy-light 
chain disulfide bond formation. At this stage no primer 
was constructed to amplify lambda light chains since they 
constitute only a small fraction of murine antibodies. In 
addition, Fv fragments have been constructed using a 3« 

15 primer which is complementary to the to the mRNA in the J 
(joining) region (amino acid 128) and a set of unique 5» 
primers which are complementary to the first strand cDNA 
in the conserved N-terminal region of the processed 
protein . Restriction endonuclease recognition sequences 

20 are incorporated into the primers to allow for the cloning 
of the amplified fragment into a lambda phage vector in a 
predetermined reading frame for expression. 

B. y.-thrarv Construction 

The construction of a combinatorial library was 

25 accomplished in two steps. In the first step, separate 
heavy and light chain libraries were constructed in Lambda 
Zap II V H and Lambda Zap II V L II respectively. In the 
second step, these two libraries were combined at the 
antisymmetric EcoRl sites present in each vector. This 

30 resulted in a library of clones each of which potentially 
co-expresses a heavy and a light chain. The actual combi- 
nations are random and do not necessarily reflect the 
combinations present in the B-cell population in the 
parent animal. Lambda Zap II V„ expression vector has been 

35 us d to create a library of heavy chain sequences for DNA 
btain d by PCR amplifications of mRNA isolated from the 
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spleen of a 129 G lx + mouse previously immunized with p- 
nitrophenyl phosphonamidate (NPN) antigen 1 according to 
formula I (Figure 13) conjugated to keyhole limpet 
hemocyanin (KLH) . 
5 The NPN-KLH conjugate was prepared by admixture of 

250 ul of a solution containing 2.5 mg of NPN according to 
formula 1 (Figure 12) in dimethyl formamide with 750 ul of 
a solution containin g 2 m g of KLH in 0.01 M sodium phos- 
phate buffer (pH 7.2) . The two solutions were admixed by 
10 slow addition of the NPN solution to the KLH solution 
while the KLH solution was being agitated by a rotating 
stirring bar. Thereafter the admixture was maintained at 
4«C for 1 hour with the same agitation to allow conjuga- 
tion to proceed. The conjugated NPN-KHL was isolated from 
15 the nonconjugated NPN and KLH by gel filtration through 
Sephadex G-25. The isolated NPN-KLH conjugate was used in 
mouse immunizations as described in Example 3. 

The spleen mRNA resulting from the above immuniza- 
tions was isolated and used to create a primary library of 
20 V H gene sequences using the Lambda Zap II V H expression 
vector. The primary library contains 1.3 x 10 6 pfu and has 
been screened for - the expression of the decapeptide tag to 
determine the percentage of clones expressing Fd 
sequences. The sequence for this peptide is only in frame 
25 for expression following the cloning of an Fd (or V H ) 
fragment into the vector. At least 80% of the clones in 
the library express Fd fragments based on immuno-detection 
of the decapeptide tag. 

The light chain library was constructed in the same 
30 way as the heavy chain and shown to contain 2.5 x 10 6 
members. Plaque screening, using the anti-kappa chain 
antibody, indicated that 60% of the library contained 
expressed light chain inserts. This relatively small 
percentage of inserts probably resulted from incomplete 
35 dephosphorylation of vector after cleavage with Sac I and 
Xba I. 
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Once obtained, the two libraries were used to 
construct a combinatorial library by crossing them at the 
EcoRl site. To accomplish the cross, DNA was first 
purified from each library. The light chain library was 
cleaved with Mlul restriction endonuclease, the resultxng 
5 . ends dephosphorylated and the product digested with 
EcoRl. This process cleaved the left arm of the vector 
into several pieces but thejlght^rm^ontain^ng^ej^ 
^irsegu^e¥T^5alSed intact. In a parallel fashion, 
the DNA of heavy chain library was cleaved with HindHI, 
dephosphorylated and cleaved with EcoR I, destroying the 
right arm but leaving the left arm containing the heavy 
chain sequences intact. The DNA's so prepared were then 
combined and ligated. After ligation only clones which 
15 resulted from combination of a right arm of light chain- 
containing clones reconstituted a viable phage. After 
ligation and packaging, 2.5 x 10 7 clones were obtained. 
This is the combinatorial Fab expression library that was 
screened to identify clones having affinity for NPN. To 
determine the frequency the phage clones which co-express 
the light and heavy chain fragments, duplicate lifts o the 
light chain, heavy chain and combinatorial libraries were 
screened as above for light and heavy chain- expression, 
in this study of approximately 500 recombinant phage 
25 approximately 60% co-expressed light and heavy chain 
proteins. 

C. Antigen Binding 

All three libraries, the light chain, the heavy chain 
and Fab were screened" to determine if they contained 

30 recombinant phage that expressed antibody fragments bind- 
ing NPN. in a typical procedure 30,000 phage were plated 
and duplicate lifts with nitrocellulose screened for 
binding to NPN coupled to 125 I labeled BSA (Figure 15). 
Duplicate scr ens of 80,000 recombinant phage from th 

35 light chain library and a similar number from the heavy 
chain library did not id ntify any clones which b und th 
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antigen. In contrast, the screen of a similar number of 
clones from the Fab expression library identified many 
phage plaques that bound NPN (Figure 15) . This observa- 
tion indicates that under conditions where many heavy 
5 chains in combination with light chains bind to antigen 
the same heavy or light chains alone do not. Therefore, 
in the case of NPN, it is believed that there are many 
heavy and lig ht chains that only bind antigen when the y 
are combined with specific light and heavy chains 

10 respectively. 

To assess the ability to screen large numbers of 
clones and obtain a more quantitative estimate of the 
frequency of antigen binding clones in the combinatorial 
library, one million phage plaques were screened and 

15 approximately 100 clones which bound to antigen were 
identified. For six clones which were believed to bind 
NPN, a region of the plate containing the positive and 
approximately 20 surrounding bacteriophage plaques was 
"cored", replated, and screened with duplicate lifts 

20 (Figures 15) . As expected, approximately one in twenty of 
the phage specifically bind to antigen. "Cores" of 
regions of the plated phage believed to be negative did 
not give positives on replating. 

To determine the specificity of the antigen-antibody 

25 interaction, antigen binding was competed with free 
unlabeled antigen as shown in Figure 16. Competition 
studies showed that individual clones could be distin- 
guished on the basis of antigen affinity. The concentra- 
tion of free hapten required for complete inhibition of 

30 binding varied between 10-100 x 10- 9 M suggestion that the 
expressed Fab fragments had binding constants in the 
nanomolar range. 

D . mrnnosition of th* clones and Their Expressed 
Products 

35 In preparation for characterization of the pr t in 

products able t bind NPN as d scrib d in Example 19C, a 
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plasmid containing the heavy and light chain .genes was 
excised fro* the appropriate "cored" bacteriophage plague 
using M13m P 8 helper phage. Mapping of the excxsed plasmid 
demonstrated a restriction pattern consistent with incor- 
poration of heavy and light chain sequences. The protein 
products of one of the clones was analyzed by ELISA and 
Western blotting to establish the composition of the NPN 
binding protein. A -bacterial-supemate-f ol-lowing IPT6- 
induction was concentrated and subjected to gel filtra- 
tion. Fractions in the molecular weight range 40.60 kD 
were pooled, concentrated and subjected to a *«r««M£ 
filtration separation. As illustrated in Figure 17, ELISA 
analysis of the eluting fractions demonstrated that NPN 
binding was associated with a protein of molecular weight 
about 50 kD which immunological detection showed contained 
both heavy and light chains. A Western blot (not shown) 
of a concentrated bacterial supemate preparation under 
non-reducing conditions was developed with anti- 
decapeptide antibody. This revealed a protein band of 
aolecular weight of 50 kD. Taken together these results 
are consistent with NPN binding being a function of Fab 
fragments in which heavy and light chains are covalently 
linked. 

20. fi r ~™,nbin a sP-™^1VSed Peromh jnstjpn 

Experiments directed to the An vivo recombination of 
two lambda vectors using flp recombinase-catalyzed recom- 
bination are described. The flp recombination site was 
introduced into the phage vectors using 39mer synthetic 
oligonucleotides. The sequence of the flp site utilized 
for recombination was derived from several references 
(e . q / senecoff PESS, Nj^ A^ SsU HS* 82:7220- 

7224 (1985) ) . The Xbal site within the 8bp core was elim- 
inated as this site was to be used in the cloning 
strategy. This was accomplished by making a p int muta- 
35 tion which has li<*le r no effect on its ability to allow 
rec mbination (McLeod et_al., MeJU. SSlU fllfil* 6*3357- 
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3367 (1985)). However, this point nutation is not 
required for the system to function. The oligonucleotides 
were further designed to be inserted in the EcoRl sites of 
the Lambda Zap II V„ and Lambda Zap II V L vectors so that 
only one flanking EcoRl sites would be regenerated (see 
Figure 18). The flanking sequences are not essential to 
the system. 


The following sequences were inserted into Lamba Zap II V H : 

(110) Oligo 7 9 E c o R 1 
10 AATTCGAAGTTCCTATTCTCTAAAAAGTATAGGAACTTC 3« 

(111) Oligo 80 GCTTCAAGGATAAGAGATTTTTCATATCCTTGAAGTTAA 
51 

The following sequences were inserted into Lambda Zap II 
V L : 

15 (112) O 1 i 9 08 1 

AATTGAAGTTCCTATTCTCTAAAAAGTATAGGAACTTCG EcoRl 3« 
(113) Oligo 82 CTTCAAGGATAAGAGATTTTTCATATCCTTGAAGCTTAA 

5« 

Vectors were constructed as follows. The first two 
20 oligonucleotides were mixed (0.5 pg oligo 79, 0.5 ng oligo 
80, 1 Ml 200 mM Tris, pH 7.4, 20 mM MgCl, 500 mM NaCl, and 
H 2 0 to 10 nl), heated to 85'C 5 min., and allowed to cool 
to room temperature \ over 1 hour in a water bath. The 
procedure was repeated using oligos 81 and 82. 
25 Ligation into - vector arms was accomplished by 

digesting Lambda Zap V, and Lambda Zap II V L with 3U/jxg 
EcoRl according to standard digestion procedure. After 
phenol/chloroform extraction, DNA was precipitated with 
EtOH. The vector was not phosphatase treated so that the 
30 oligonucleotides could be inserted without kinase treat- 
ment, thus preventing multiple tandem oligonucleotide 
inserts. Ligations were performed in 5 ul volumes using 
1 fit} of lambda DNA and 1 ng of annealed oligonucleotides 
acc rding to standard ligation protocol (se Maniatis et 
35 sXu, supra ) . Ligation mixes were packaged using Gigapack 
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Gold™ (Stratagene Cloning Systems, San Diego, CA) 
according to the protocol recommended in the manual. 

Following packaging, the vectors were screened. 
Packaged DNA was plated according to the Gigapack Gold 
5 manual procedure on NZY agar with approximately 400 pfu 
per 100 mm Petri dish. Duplicate plaque lifts were done 
according to the protocol in the Predige sted ZapII Cloning 
Manual (Stratagene Cloning Systems, San Diego, CA) on 
nitrocellulose filters, Denaturation and fixation of DNA 

10 onto the membranes is also described in the manual. 
Prehybridization was performed according to pBluescript II 
Exo/Mung DNA Sequencing System™ instruction manual 
(Stratagene Cloning Systems, San Diego, CA) for oligo- 
nucleotide probes (pg 6). Hybridization was performed 

15 overnight using 32 P kinased oligo 79 (0.5 x 10 cpm/ml) 
according to the pBluescript manual (Stratagene Cloning 
Systems, San Diego, CA) . Oligo 79 was kinased using 
standard 32 P gamma ATP labelling techniques (see Maniatis 
et al., supra ) . Filters were washed in 6X SSC, 0.1% SDS, 

20 3 times at room temperature, once at 55'C and finally at 
59-C Each was washed for approximately 10 minutes. 
Positive plaques were identified using X-ray autoradio- 
graphy. Twelve duplicate plaques were cored in 500 Ml SM, 
20 Ml chloroform. These plaques were sufficiently well 

25 isolated that secondary screening was not required. The 
cored plaques were excised according to the Predigested 
Zap II Cloning Manual (Stratagene Cloning Systems, San 
Diego, CA) and DNA from single ampicillin resistant 
colonies was sequenced using minipreped DNA and the T7 and 

30 T3 primers according' "to the DSK 35S Sequencing kit 
(Stratagene Cloning Systems, San Diego, CA) . Clones with 
flp sites in the correct orientation and opposite orien- 
tation were identified, amplified and titred. One of each 
type of clon (FLPHC+, FLPHC-, FLPLC+, FLPLC-) was used to 

35 test in yixfi flp- * diated recombination. 

in vivo flp-m diated recombination was accomplished 
as f llows. Flp recombinas was expr ss d off th tac 
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prom ter on a plasmid, P CS 3/ in p. coU MM294 strain 
(Lebreton et al. 1988) . This is a low copy number plasmid 
with the pACYC origin of replication and contains a 
chloramphenicol resistance gene. 

5 x 10 8 cells were coinfected with FLPHC and FLPLC 
vectors at an moi of 5 and 10 pfu each per cell, 
combinations of FLPHC+ and FLPLV+, or FLPHC- and FLPLC- , 
-or-FLPHG+-and-FLPLC«^-were_tested^ 


10 


overnight cultures of MM294(pCS3) were grown in LB, 
spun down and resuspeiided in 10mM MgS0 4 at a density of 
1.0. The appropriate amounts of phage were added 


OD, 


'600 


to 0.5 ml of cells and allowed to adhere at 37 *C for 15 
minutes. 50 ml of NZY was added to each flask and incu- 
bated for 2 hours with shaking. 250 fil of chloroform was 
15 added to 25 ml of lysate and incubated for 15 minutes at 
room temperature. The supernatants were titred and 
screened for phage containing both Lambda Zap II V H left 
arms and Lambda Zap II V t right arms. Probes to identify 
zap II V H left arms and Lambda Zap II V t right arms were 
20 designed by identifying unique sequences from the known 
sequence of the vectors. 

The Lambda Zap II V H left arm probe had the following 

sequence: 

(114) CTAGTTACCCGTACGACCCCCCCGTTCCGGACTACGCTTCTTAATAG 3 « 
25 This sequence hybridizes to the decapeptide sequence of 

the Lambda Zap II V H . The Lambda Zap II V t right arm probe 
had the following sequence: 

(115) 5* GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3« 

This sequence hybridizes to the sequence from the Sacl 
30 site to the former Notl site of the Lambda Zap II V L 
vector. 

The screening procedure used was the same as that- 
used to identify the flp vectors, as described above, with 
the exception of washing conditions. Filters were washed 
35 with 6XSSC, 1%SDS 3 times at room temperature and twice at 
60 *C. Plaques which hybridized t b th pr bes were 
identified by X-ray autoradiography, cored, excised and 
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15 


20 


25 


30 


digested to determine if recombination had occurred, 
control Pia^s identified as hybridising to only one 
plob. and to neither probe were also cored Diagnostic 
Restriction digests were PvuII, Prt, »*. Xhol/Pvul, 
sac" Sac/PVul. Hotl. Xbal, Seal, Spel, Spel/Puul 
^triction digest results verified that recombination at 
tte flP site occurred In yiyo. in cells expressing the tip 
^oSLse gene and ^ih^ntrol-S^a^^i ceXVs_ 
(Stratagene cloning systems, San Diego, CM wnxch do not 
normally express flp recombinase. 

Efficiency of recombination according to the number 
of plagues, identified as hybridising to bot* .probe, . was 
Initially between about 5-10%. Changes to the protocol 
^„ be made, however, which will improve & 
recovery of recombined vectors. For example, by adding 
reikis marKer sequences to the left and right arms of 
the vectors, up to 100% of target recombinants can be 
Identified (Figure SO) . Adding selection 
ensure that all recombinants contain inserts will also 
Increase the efficiency of identifying the desired clones. 
• xn Example 19 a relatively • restricted library was 
prepared because only a limited number of primers were 
used for PCR amplif ication of Fd sequences. The library 
ls expected to contain only clones expressing kappa/gamma 
segules. However, this is not an inherent Imitation of 
method since additional primers can be «"***» 
amplify any antibody class or subclass. Despite thxs 
restriction we were able to isolate a large number of 
Intigen binding clones. Of interest is how a phage 
library prepared as described herein compares with the in 
^tLody repertoire in terms of size, characteristics 
of diversity, and ease of access. 

The size of the mammalian antibody «P«^« *• 
difficult to judge but a figure of the order of 10-10 
different antigen sp cificlties is often qu ^ted. Witt 
so me of th r servati ns discussed below, a phage lib»ry 
of this size or large can readily » constructed by a 
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modification of the current method. In fact once 
initial combinatorial library has been constructed, heavy 
and light chains can be shuffled to obtain libraries of 
exceptionally large numbers. 
5 In principle, the diversity characteristics of the 

naive (unimmuhized) In vivo repertoire and corresponding 
phage library are expected to be similar in that both 
involve a random combination of heavy and light chains. 
However, different factors will act to restrict the 
10 diversity expressed by an in vlyo repertoire and phage 
library. For example a physiological modification such as 
tolerance will restrict the expression of certain anti- 
genic specif icities from the in vjvo repertoire but these 
specificities may still appear in the phage library. For 
15 example, the representation of mKNA for sequences 
expressed by stimulated B-cells can be expected to 
predominate over those of unstimulated cells because of 
higher levels of expression. Different source tissues 
( e.g. . peripheral blood, bone marrow or regional lymph 
nodes) and different PCR primers ones expected to 

amplify different antibody classes) may result in library 
with different diversity characteristics. 

Another difference between jn vlyo repertoire and 
phage library is that antibodies isolated from the former 
25 may have benefited from affinity maturation due to somatic 
mutations after combination of heavy and light chains 
whereas the latter randomly combines the matured heavy and 
light chains. Given a large enough phage library derived 
from a particular in vivo repertoire, the original matured 
heavy and light chains will be recombined. However, since 
one of the potential benefits of this new technology is to 
obviate the heed for immunization by the generation of a 
single highly diverse "generic" phage library, it would be 
useful to have methods to optimize sequences to compensate 
35 for the. absence of somatic mutation and clonal selection. 
Three procedures are made readily available thr ugh the 
methods of the pr sent invention. First, saturation muta- 
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genesis may be performed on the CDR's and the resulting 
Fabs can be assayed for increased function- Second, a 
heavy or a light chain of a clone which binds antigen can 
be recombined with the entire light or heavy chain 
libraries respectively in a procedure identical to the one 
used to construct the combinatorial library. Third, iter- 
ative cycles of the two above procedures can be performed 
to further opt -iaize-the-affinity-or-catalytic-properties 


tU J~ ui biic^ w^w*»*i#^ - 

of the immunoglobulin. It should be noted that the latter 
10 two procedures are not permitted in B-cell clonal selec- 
tion which suggests that the methods described here may 
actually increase the ability to identify optimal 
sequences. 

Access is the third area where it is of interest to 

15 compare the in vivo antibody repertoire and phage library, 
in practical terms the phage library is much easier to 
access. The screening methods allow one to survey at 
least 50,000 clones per plate so that 10* antibodies can be 
readily examined in a day. This factor alone should 

20 encourage the replacement of hybridoma technology with the 
methods described here. The most powerful screening 
methods utilize selection which may be accomplished • by 
incorporating selectable markers into the antigen such as 
leaving groups necessary for replication of auxotrophic 

25 bacterial strains or toxic substituents susceptible to 
catalytic inactivation. There are also further advantages 
related to the fact that the in_viy£ antibody repertoire 
can only be accessed via immunization which is a selection 
on the basis of binding affinity. The phage library is 

30 not similarly restricted. For example, the only general 
method to identify antibodies with catalytic properties 
has been by pre-selection on the basis of affinity of the 
antibody to a transition state analogue. No such restric- 
tions apply to the in vivo library where catalysis can, in 

35 principl , be assayed directly. The ability to dir ctly 
assay large numbers f antibodies for function may allow 
6 el ction for catalysts in reactions wh re a mechanism is 
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not veil defined or synthesis of the transition state 
analog is difficult. Assaying for catalysis directly 
eliminates the bias of the screening procedure for 
reaction mechanisms pejorative to a synthetic analog and 
therefore simultaneous exploration of multiple reaction 
pathways for a given chemical transformation are possible. 

Although we have given examples of several screening 
methods, it should be clear to one skilled in the art that 


meuiuuDf jlw o*w — — ■ 

alternative methods of screening, usch as by panning dels 
10 or particles expressing the protein product on their sur- 
face would essentially be equivalent. If the expressed 
gene products of interest are RNA molecules instead of 
proteins, screening could be accomplished by nucleic acid 
hybridization or by detecting some functional property of 
15 th eRNA, such as ribozyme catalysis. 

The methods disclosed herein describe generation of 
Fab fragments which are clearly different in a number of 
important respects from intact (whole) antibodies. There 
is undoubtedly a loss of a affinity in having monovalent 
20 Fab antigen binders but this can be compensated by selec- 
tion of suitably tight binders. For a number of applica- 
tions such as diagnostics and biosensors it may be prefer- 
able to have monovalent Fab fragments. For applications 
requiring Fc effector functions, the technology already 
25 exists for extending the heavy chain gene and expressing 
the glycosylated whole antibody in mammalian cells. 

The ideas presented here address the bottle neck in 
the identification and evaluation of antibodies. It is 
now possible to construct and screen at least three orders 
30 of magnitude more clones with mono-specificity than previ- 
ously possible. The potential applications of the method 
should span basic research and applied sciences. 

2i. m < r^m^leo-M^ Primer _pesj.gn for Product 

p^eistro r»ic DNA 

35 A method based on PCR amplification that fuses heavy 

and light chain sequences has been used t c nstruct a 
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complete antigen binding domain of a Fab protein fragment 
composed of a heavy and a light chain. Schematic diagrams 
of an immunoglobulin molecule composed of heavy and light 
chains containing constant and variable regions is shown 
5 in Figure 1. Human heavy chain IgG and human kappa light 
chain are diagrammatically sketched in Figures 2A and 2B, 
respectively. To accomplish this procedure, immunoglobu- 
iin-heavy-and light chain-primers were-designed to produce^ 

a region of homology between two polymerase chain reaction 

10 (PCR) products. The complementary regions have been shown 
to hybridize predominantly under conditions where one set 
of primers ("inside primer pair") is used in a limiting 
amount relative to the other set of primers ("outside 
primer pair") . After the .3 • ends of the PCR products have 

15 hybridized, the DNA polymerase has been shown to extend 
the ends creating a fusion sequence carrying the unique 
sequences of both PCR fragments separated by one copy of 
region X cistronic bridge. A two-step cloning procedure 
is thus avoided. When the recombinant sequence is then 

20 inserted into an expression vector such as XmmunoZAP, a 
fusion production capable of simultaneously expressing the 
heavy and light chains can be produced. 

The strategy used for producing immunoglobulin heavy 
and light chain PCR dicistronic DNA is shown schematically 

25 in Figure 21. Regions of the immunoglobulin heavy chain 
coding strand are designated V H , C„l, C„2, and C H 3 corres- 
ponding to functional regions in the protein. The corres- 
ponding regions of the non-coding strand are designated by 
a prime ( ■ ) . Regions V L and C L are similarly labelled for 

30 the kappa light chain. This procedure can also be per- 
formed using lambda light chain specific regions. A 
region, X, unrelated to the natural immunoglobulin 
sequences, is introduced into the fusion product by 
attaching X to the 5- ends of both of the C„l' and V L 

35 inside primers. 

Overlapping oligonucleotid . primers used in the 
fusion-PCR reactions t produce dicistronic DNA were 
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designed to encode the following: amino acids of 225 to 
230 of the igc heavy chain hinge region which are common 
to all human ZgG isotypes; an Spe I restriction site; two 
stop codons; a ribosome binding site; a periplasmic (pelB) 
leader sequence (Better, et al., SsiSLSZ, 240:1041-1043 
(1988); Lei, etal., ,T. Bacterid-, 169:4379-4383 (1988) ) ; 
a Sac I restriction site which encodes amino acids 1 and 
„2_of the mature kappa^igMLJ^^^ 
of the mature kappa light chain. The X region was 
designed to contain a ribosome binding site and a pelB 
leader to ensure expression of the light chain. 
Nucleotide sequences for all human and mouse PCR primers, 
both inside and outside, are listed in Table 11. Primers 
followed by a prime (•) represent non-coding strand 
15 sequences. 

Tf»hie 11 

Human and Mouse PCR Primers 
Seq. 

j&~ No. Human 

20 (117) V„ 5 • -GTCCTGTCCGAGGTGCAGCTGCTCGAGTCTGG-3 ' 

(118) c"l« 5«-AATAACAATCCAGCGGCTGCCGTAGGCAATAGGT 

ATTTCATTATGACTGTCTCCTTGCTATTAACTAG 
TACAAGATTTGGGCTC— 3 1 

(119) V L 5 • -GCCTACGGCAGCCGCTGGATTGTTATTAATCGCT 
25 L GCCCAACCTGCCATGGCTGAGCTCGTGATGACCC 

CAGTCTCC-3 ' 

(120) <V 5 ' -TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA 

GCTCTTTGTGACGGGCGAACTC-3 ' 

Mouse ' 
30 (121) V fl 5 * -AGGTCCAGCTGCTCGAGTCTGG-3 ' 

(122) C H 1' 5 ' -AATAACAATCCAGCGGCTGCCGTAGGCAATAGG 

TATTTCATTATGACTGTCTCCTTGCTATTAACT 
AGTATACAATCCCTGGGCACAAT-3 • 

(123) V L 5 ' -GCCTACGGCAGCCGCTGGATTGTTATTAATCGC 

TGCCCAACCTGCCATGGCTGAGCTCGTGATGAC 

CCAGTCTCC-3 • 


35 
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5 < -TCCTTCTAGATTACTAACACTCTCCCCTGTTGAA-3 


The overlapping regions of the human C H 1' inside and 
V inside primers are illustrated in Figure 22. The heavy 
5 chain downstream C„l' inside primer sequence is written 3- 
to 5' and the light chain upstream V L inside primer 
sequence is written 5» to- 3'. The complementary PGR 
product strands^andn^^ 

txTcreate^thTdicistronic molecule. Bold nucleotides 

10 represent regions where the 0,1' inside primer hybridizes 
to the 3 • end of C„l on human igG heavy chain mRNA or where 
the V L inside primer hybridizes to the 5* end of V t frame- 
work on human kappa light chain cDNA. The amino acid and 
nucleotides in italics represent changes in sequence from 

15 the original pelB leader sequence. 

At amino acid 15 of the pelB leader sequence, the 
codon was changed from CTC to ATC resulting in a conserv- 
ative amino acid change from a leucine to an isoleucine as 
shown in Figure 22 and Table 11.. Hydrophobic amino. acids 

20 in the core region of periplasmic leader sequences have 
been shown to be essential for correct processing of the 
leader sequence and transport of the mature protein to the 
periplasm. Oliver, in Neidhardt, R.C. <ed.), Escherichia 
^ aT ,„ SKlm^ T- ^^urlum.. Am. Soc. Microbiol., 

25 1-56-69 (1987) . The nucleotide changes were made to allow 
for the artif actual insertion of one or two dATPs at the 
3 . end of the overlapping dicistronic molecules. Thermus 
amifiiicus (Tag) DNA polymerase may add a dATP to the 3- 
end of the PCR product because of terminal transferase 

30 activity. Jiang, etg al. fincogene, 4:923-928 (1989). The 
additional dATP would then cause a mismatch between the 
overlapping PCR products at the 3' terminus and inhibit 
elongation by Taq DNA polymerase, sommer, et al. msl^ 
Acidg^es^, 17:6749 (1989). Therefore, the change to two 

35 dTTPs in this position of the oligonucleotide primers 
would allow proper base pairing if up to two dATPs were 
added to th 3« terminus of th heavy chain PCR product. 
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Tne Kappa light chain PCB product was designed to termi- 
nate at a position where two dTIPS occur ox the end of 
Z Product and did not retire alterations of ^nucleo- 
tide sequence. Nucleotides were changed in the Kappa 
-, Ught chain primer encoding the pelB leader sentence with- 
, ligm cuoj. f . ^ order to decrease 

out introducing amino acid changes tn or _ 
tne number of mismatches between the primer and the leader 
-se^ence-of^^ 

o ^ were synthesized on an Biosystems 

* „ . , , flU knowing the manufacturer's 

DNA synthesizer, Model 38 1A, following «i 

instructions. 

Cloned DHX, previously isolated from a 
library that encodes human Fab fragments which bind 
«tan«s toxoid (TT) was used as a template <°™"£> 
a v,-and V t -codln, repertoire. Mullinax, et .1., 
20 Briefly, tte combinatorial library was prepared by the 
foxing approach, volunteer donors, who had been pre- 
viously LIi*ed against tetanus but had not received 
booster injections within the last year, received injec- 
tions on 2 consecutive days of 0.5 milliliters (ml, of 
25 alum-absorbed tetnus toxoid (TT) (40 microgram,.! (ug,/ml> 
(Connaught Laboratories, swiftweter, Pennsylvania) . 
( on! hundred ml of blood was drawn from the volunteers 
6 da ys post injection and anticoagulated with a mixture of 
0 !4 M citric acid, 0.2 M trisodiu. citrate, and 0.22 H 
30 oe*trose. The peripheral blood lymphocyte. (FBIjS) were 
ttcovered and isolate from the whole blood by layering. 
Z whole blood on Bistopaque-1077 (Sigma. St. Wuis 
Missouri) and centrifuge at 400 x g for 30 minute, at 25 
degrees Celsius (25-C). Isolated FBI* were washed twice 
35 wtl. Phosphate buffered saiine (PBS) (ISO m .sodium 
chl ride and 150 mM sodium phosphate, pH 7.2 at 25-C) . 

SUBSTITUTE 


PCI7US91/02910 

WO 91/16427 


131 

Total BHA was then filled from the PBLs (10* B cells 
per ml blood per 100 ml of blood, for an ennched source 
!t B-cell mRNA coding for antiTT iga using an FNA isola- 
l n lit according to manufacturer's instructions 
tion Kit cali£o mia) and also described by 

(stratagene. La Jolla, Callfomi j .„.,«-.- (198 7). 
ChomczynsKi «t al., M i l iMoPhSB. . ,„ l^or a 

-tefly, the isolated PBLs were homogenized in 10ml of a 


denaturing solute" — - '»%»»♦■•.- 

a te. '^r r. b je. 

££; Ten ml of Phenol ? « - J^^-^ 
a ted with H,0 was also admixed to 

atea wi U1 2 . ^ 0 f a chloroform: 

^tztjz-t^ - added to h omo- 

" ' ^'T.S - — vigorously for ten 
genate. The ixo» y minutes . The homo- 

seconds and maintained on ice for 15 mum 
aenlte was then transferred to a thick-walled 50 ml 
Propylene centrifuged tuhe (Fisher Scientific Company 

poiypropy* ,,„„<=,! The solution was centrifuged 

20 Pittsburgh, Pennsylvania). ^The sol ^ 

at 10,000 x g for 20 minutes at 4 c. ™ , pp .„ . 
containing agueous layer was transferred to a fresh 50 ml 
Powropylenf centrifuge tube and mixed with an ^eaual 
volume oi isopropyl alcohol. This solution was 

25 at -20-C for at least one hour to precipitate the *NA. 

25 a-c ^Y-eetnitated RNA was centri- 

The solution containing the precipitated . 

* a »t io ooo x g for twenty minutes at 4 C. The 
!:Heted total cellular 1» «• «"-ted and dissolved in 
3 ml of the denaturing solution described above. Three ml 
„ of isopropyl alcohol was added to the re-suspended total 
30 of isopropyl solution was main- 

cellular RNA and inverted to mix. i«" ..,-.„. 
!<ned at -20-c for at least 1 hour to precipitate the 
^ Z solution containing the precipitated PNA was 
~ntrif7,ed at 10,000 x g f or t n minutes at 4-C. The 
36 £ ^ was wash d once with a solution containing 

« % ethanol. The pelleted MA was dried under vacuum f r 
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15 minutes and then re-suspended in diethyl pyrocarbonate 
(DEPC) treated (DEPC-H 2 0) 1^0). 

Messenger SNA (mRNA) was prepared from the total 
cellular RNA using methods described in M o ^mar QonllW 
. T ^o^t nT -v Manual, Maniatis et al., eds., Cold Spring 
Harbor, NY, (1982). Briefly, 500 mg of the total RNA 
isolated from a FBI* prepared as described above was re- 
suspended in one ml of IX sample buffer (1 mM Tris-HCl, 


10 


15 


20 


25 


30 


35 


(Tris [hydroxylmethyl-aminomethane]) pH 7.5; 0.1 mM EDTA 
(disodium ethylene diamine tetra-acetlc acid) , 0.5 M NaCl) 
and maintained at 65'C for five minutes and then 
for five more minutes. The mixture was then applied to an 
oligo-dT (Stratagene) column that was previously prepared 
by washing the oligo-dT with a solution containing 10 mM 
Tris-HCl, P H 7.5; 1 mM EDTA, 0,5 M NaCl. The eluate was 
collected in a sterile polypropylene tube and reapplied to 
the same column after heating the eluate for five minutes 
at 65-C. The oligo dT column was then washed with 0.4 ml 
of high salt loading buffer consisting of 10 mM Tris-HCl 
at P H 7.5, 500 mM sodium chloride, and 1 mM EDTA. The 
oligo dT column was then washed with 2 ml of 1 X low salt 
buffer consisting of 10 mM Tris-HCl at P H 7.5, 100 mM 
sodium chloride, and 1 mM EDTA. The messenger RNA was 
duted from the oligo dT column with 0.6 ml of buffer 
consisting of 10 mM Tris-HCl at pH 7.5, and ImM EDTA. The 
messenger RNA was purified by extracting this solution 
with phenol/chloroform followed by a single extraction 
with 100% chloroform. The messenger RNA was concentrated 
by ethanol precipitation and re-suspended in DEPC H^O. 

The messenger RNA isolated by the above process 
contains a plurality of different V H and V L coding poly- 
nucleotides, i.e., greater than about 10* different V H - 

and V,-coding genes . 

isolated RNA was converted to cDNA by a primer exten- 
sion reaction with a first-strand synthesis kit acc rding 
t manufacturer's instructions (Stratagene) by using an 
oligo (dT) primer for the light chain and a specific 
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primer, C H 1', for the heavy chain. Mullinax et al., s^pxa. 
in a typical 50 M l transcription reaction, 5 ug of PBL 
mRNA in water was first hybridized (annealed) with 200 ng 
(50.0 pmol) of an oligo (dT) primer for the light chain. 
5 in a separate reaction, 5 ug of PBL mRNA in water was 
first hybridized (annealed with 200 ng (20 pmol) of the 
heavy' chain primer, C H 1', at 65-C for five minutes 
Subsequently, the- mixture was adjusted to 0,5 aM-each of 
dATP, dCTP, dGTP and dTTP, 50 mM Tris-HCl at pH 8.3, 3 mM 

10 M gCl 2 75 mM KC1, 10 mM DTT, 20 units of BHase block II 
(Stratagene), and 20 units of Moloney-Murine Leukemia 
virus reverse transcriptase (Stratagene Cloning Systems) , 
was added and the solution was maintained for 1 hour at 
37 -C. PCR amplification of the heavy and light chain 

15 sequences Was done separately using 0.25-0.5 ug of first- 
strand synthesis product as template with sets of primer 
pairs using Tag DNA polymerase as described in Example 23. 

The PCR amplified light chain DNA fragments were then 
digested with sac I and Xba I and ligated into a modified 

20 Lambda Zap II vector as prepared in Example 29 to form a 
light chain immunoZap Library (ImmunoZAP L; Stratacyte, La 
Jblla, California) . The PCR amplified heavy chain DNA was 
digested with Spe I and Sho I and ligated into a different 
modified Lambda Zap II vector as prepared, in Example 27 to 

25 form a heavy chain ImmunoZap Library (ImmunoZAP H; 
Stratacyte). The resulting libraries were amplified and 
the resulting DNA was packaged into bacteriophage with in 
vi^ES packaging extract, Gigapack II gold (Stratagene) and 
used to infect E. co U strain XLl-Blue (Stratagene). 

30 to construct a library for coexpression, the right 

art of the heavy , chain library phage DNA was digested with 
Hind III, preserving the left arm of ImmunoZAP H with a 
heavy chain inserts. The left arm of the light chain 
library phage DNA was digested with Mlu I resulting in a 

35 right arm of ImmunoZAP with kappa light chain inserts. 
Both products were then digested with EcoRI and ligated 
to create a combinat rial library that encoded human Fab 
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fragments including those specific for TT. Mullinax, et 
al. , supra . 

Reactive plaques were first identified by binding to 
tetanus toxoid as described in Example 31. Bacteriophage 
5 from purified reactive plagues were then -converted to the' 
plasmid format by in vivo excision with R408 helper phage 
(Stratagene) following methods described in Example 31 and 
familiar to one skilled in the art . Short, et al . , Easl^. 
^cidsTRes. . 16:7583^7600 (1988) . The resulting purified 
10 plasmid DNA encoding heavy and light chain was then used 
in PCR reactions as described below in Example 23. 

B. Preparation of a v„- and V L -Codino Repertoire, fron 
mRNA from Tissues and Cells 

(i) Human 

15 Purified populations of PBLs, other lymphocytes ; and 

hybridomas which express immunoglobulins including XgG, 
IgM/ IgE, IgD, and IgA are used as sources for isolating 
mRNA encoding immunoglobulins. PBL's and other immuno- 
globulin expressing lymphocytes are isolated from either 

20 spleen, lymphoid tissue or plasma. Following purification 
of the cells, total RNA is then purified from these cells 
using a RNA isolation kit (Stratagene) as described in 
Example 22a. The purified RNA is then converted to cDNA 
with a first-strand synthesis kit as described in Example 

25 22a. The resultant cDNA is then used as a template in PCR 
amplification reactions as described below in Example 23 
for the production of dicistronic molecules expressing 
heavy and light chains. 

(ii) Mouse 

30 Populations of cells described above can be isolated 

from other mammalian sources 6uch as mouse or rabbit. 
Both mRNA and rearranged DNA can be isolated as described 
above .and used as templates in PCR amplificati n reac- 
tions. cDNA synthesized from mRNA is lated from a mous 

35 anti-human fibronectin hybridoma (ATCC, CRL-1606) was used 
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as a preferred template for th production of dicistronic 
molecules expressing heavy and light chain. 


c. 


rpnaraticr * ^^Codil 


5 


Rearranged DNA isolated from PBLs, other lymphocytes, 
and hybridomas which express immunoglobulins can be used 

TroLL for preparing a V H -coding repertoire using 
rearranged DNA is performed as described in Example 23. 


cioned DNA, prepared in ***** " «** * 
torlal library that encodes human Fab fragments which bind 
tetanus toxoid <TT) , was used as a template ^ preparing 
1S a v.-coding double etrMded DNA homolog. Human heavy 
chain, containing both th. V, and C.1 coding region and 
aesignated as Fd. was amplified in a PC* reaction. J» 
amplification was performed in a 100 ul react ion, =ontain- 
i„, S nanograms (ng) of the cloned DNA in PC* buffer 
20 consisting of the following: 10 m» Tris-HCl. PH 8 3, 50 
mK KCl. l. 6 mMH,cl„ 0.001* (w/v, gelatin, 200 m* ^each 
oOTP, 200 nanomolar (n«) of each primer, and^.5 units .of 
Tag DNA polymerase. The human V, outside primer and 
inside primer were used as a PCR primer pair for emplifi- 
25 cation of the heavy chain (table 11 and Figure 21, The 

reaction mixture was overlaid with mineral oxl and sub- 
jected to 40 cycles of amplification. "^"""f^ 
Lie (thermocycle) involved denaturatxon at 94-c for.1.5 
lutes, annealing at 54-C for 
30 nucleotide synthesis by primer extension »* 
,2-C for 3.0 minutes followed by a return to the denatur- 
ation temp rature. The resultant amplified v,-coding DNA 
homolog containing «mpl«s were then g 1 purified, 
extracted twice with phenol/chl reform, once with chloro- 
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form followed by ethanol precipitation and were stored at 
-70*C in 10 mM Tris-HCl f pH 7.5, and 1 mM EDTA. 

To verify the amplification of the heavy chain, the 
PGR purified products were electrophoresed in an agarose 
5 gel. The expected size of the heavy chain was approxi- 
mately 730 base pairs as shown in Figure 23. The V H - 
coding double stranded DNA homologs were then used in 
subsequence PGR— amplification react ions„with_V L r„coding 
counterparts prepared below for the production of dicis- 
10 tronic DNA molecules having V„ and V t cistronic portions as 
illustrated in Example 24. 

B. V L -Codinq Double stranded nNA Homoloas 

Cloned DNA, prepared in Example 22 from a combina- 
torial library that encodes human Fab fragments which bind 

15 tetanus toxoid (TT) , was used as a template for preparing 
a V L -coding double stranded DNA homolog. Human light 
chain, containing the entire coding region of kappa light 
chain (V L and CJ , was amplified using the same PCR condi- 
tions described for human heavy chain with the exception 

20 that a human V L inside primer and C^ 1 outside primer were 
used as the PCR primer pair (Table 11 and Figure 21) . The 
resultant V L -coding double stranded DNA homolog was gel 
purified and stored as described above. 

To verify the amplification of the light chain, the 

25 PCR purified products were electrophoresed in an agarose 
gel. The expected size of the light chain was approxi- 
mately 690 base pairs as shown in Figure 23. The VL- 
coding double stranded DNA homologs were then used in 
subsequent PCR amplification reactions with V H -coding 

30 counterparts prepared above for the production of 
dicistronic DNA molecules as illustrated in Example 24. 
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24 . p r ^ r ^^^n of ^ rn^iiv-PriiTd mpipv™ of V„- and 
v -coding P w fl Homoloa 

a. m i- n — r-n - v ^ p ™ * olnol ° qp / 

' The V„- and V L -coding double stranded DNA homologs 
prepare 'in Examples 23A and 23B, respectively, were 
admixed together and denatured at 95'C for 5 mxnutes to 
separate the strands of each homolog. The denatured V H - 
_ and _v^coding-DH A -strands--in-the-ad B ixture_were_then 

annealed at 54 »C for 5 minutes to for. a V H - and V t -coding 
duplex DNA molecule hybridized at the 3- ends at region X 
of each original - homolog. one strand of the X region 
(cistronio) bridge encodes at least one stop codon in the 
same reading frame as the upstream cistron, a ribosome 
binding site downstream from the stop codon, and a 
polypeptide leader (pelB) having a translation initiation 
codon in the same reading frame as the downstream cistron 
located downstream from the ribosome binding site. 

B T^nsi^ ^ i™*»«*> DiH-tronic DNA Molecules 

The hybridized recombinant V r and V L -coding DNA 
molecule (internally primed duplex) was subjected to 
primer extension and then amplified with only the V and 
C • primers following the PCR reaction procedure described 
in Example 23A. This second PCR reaction is schematically 
represented in Figure 21. The PCR reaction products were 
oel electrophoresed to verify the presence of the result- 
ant V and V L -coding dicistronic DNA molecules. The 
expected size of the dicistronic molecule was about 1390 
base pairs and is shown in Figure 23. The resultant V ri - 
and V t -coding dicistronic DNA molecules were then ligated 
into the modified immunoZAP H vector (Figures 24A and 24B) 
for the construction of expression vectors as described in 
Example 30. 
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2 . Proration Mouse ^ r ^" ma v i ~ and ^'Cgdj^ . 

nouble st^nriPd pna ffmnnioqs and Production of 
p^str-onic pyt\ Molecul e j« ■ Single Amplification 
Reaction 

5 Mouse hybridoma heavy and light chain cDNA prepared 

in Example 22B was amplified in a single PCR reaction 
using the reaction conditions given above with an excess 

of the outside primers (200 nM concentration of both the 

mouse V B primer and primer) and a limiting amount of 

10 the inside primers (20 nM concentration of both the mouse 
C„l« and V L primer) (Table 11) . The resultant mouse heavy 
and light chain dicistronic molecules were then inserted 
into a modified immunoZAP H for construction of an 
expression vector as described in Example 30. 

15 26. Prp na r a tion »f Tn^mallv-Prlned pup^yes Using a 
Single Ihter-nal Prim e r Overlaps BPth the V„ and 

V L Repertoires 

Another approach to producing a library of dicis- 
tronic DNA molecules is to use a single internal primer 

20 instead of using two separately internal primers. The 
process of creating. a dicistronic molecule comprising an 
upstream V H cistron and a downstream V L cistron is to 
combine in a PCR buffer the following: a repertoire of V„ 
genes consisting of at least 10 s different genes; a reper- 

2S toire of V L genes consisting of at least 10 4 different 
genes; an outside V„ primer; an outside V L ; and a poly- 
nucleotide strand having a 3 '-terminal priming portion, a 
cistronic bridge coding portion, and a 5 • terminal primer- 
template portion. The PGR reaction is performed as 

30 described in Example 22A. 

The 3 '-terminal priming portion of a polynucleotide 
strand (linker) has a nucleotide base sequence homologous 
to a p rtion of the primer extension product of one of the 
outside primers. The 5 '-terminal priming portion encodes 

35 a nucleotide base sequence homologous to a portion f the 
primer extension product of the other outside primer. The 
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cistronic bridge coding portion encodes at least one stop 
codon in the sa»e reading £ra»e as the 
. ribosome binding site downstream tarn the stop codon and 
a polypeptide leader (pelB) having a translation initia- 
tion ZL in the sa»e reading rra*e as the ^re» 
cistron where the initiation codon is located ^re» 
trom the riboso-e binding site. Polynucleotide strand 


15 


20 


Table 12. 
10 fa**! e 12 
Scq. 

x . 5 . GGAGAGTGGGTCATCACGAGCTCAGCCATGGCAGGTTGG 
GCAGCGATTAATAACAATCCAGCGGCTGCCGTAGGCAAT 
. AGGTATTTCATTATGACTGTCTCCTTGCTATTAACTAGT 

ACAAGATTTGGGCTC 3' 
#1261* 2' 5' GAGCCCAAATCTTGTACTAGTTAATAGCAAGGAGACAGT 
CATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 
GTTATTAATCGCTGCCCAACCTGCCATGGCTGAGCTCGT 

GATGACCCACTCTCC 3' 

1 Primes «RNA (sense strand) of heavy chain y 
region; antisense strand of light chain V t with dicistronic 
bridge in between heavy and light chains will be in the 
same relative orientation as given in 

2 Primes antisense strand of heavy chain C H 1 regions, 
and sense strand of light chain V t region with dicistronic 
in between heavy and light chains will be an the same 
relative orientation as given in the example. 

The resultant single step internally primed 
dicistronic DNA molecule can then be ligated into modified 
Imm unoZAP H for construction of an expression vector as 
described in Example 30. 


25 
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27. PreparatloTi T^mbd? ? f *r TT Expression Vector 

The vector Lambda Zap™ II (Stratagene) is 
derivative of the original Lambda Zap (ATCC # 40,298) that 
maintains all of the characteristics of the original 
5 Lambda Zap including 6 unique cloning sites, fusion 
protein expression, and the ability to rapidly excise the 
insert i n the form of a phagemid (Bluscript SK-) , but 
lacks the SAM 100 mutation, allowing growth oh many Non- 
Sup F strains, including XLl-Blue. The Lambda Zap II was 

10 constructed as described in Short et al., Nucleic Acfrds 
Res. . 16:7583-7600, (1988) , by replacing the Lambda S gene 
contained in a 4254 base pair (bp) DNA fragment produced 
by digesting Lambda Zap with the restriction enzyme Ncol. 
This 4254 bp DNA fragment was replaced with the 4254 bp 

15 DNA fragment containing the Lambda S gene isolated from 
Lambda gtlO (ATCC #. 40,179) after digesting the vector 
with the restriction enzyme Ncol. The 4254 bp DNA frag- 
ment isolated from lambda gtlO was ligated into the 
original Lambda Zap vector using T4 DNA ligase and 

20 standard protocols for such procedures described in 
nirrent Protoeols in ar Biology., Ausubel et al., 

eds. , John Wiley and Sons, NY, 1987. 

28. Preparation of y [r E ypr*>«s< on Vectors t . Immuno3AP H and 

Modified JmnunoZAP P, Obstruction 

25 A. ImmunoZAP H 

The main criterion used in choosing a vector system 
was the necessity of generating the largest number of Fab 
fragments which could be screened directly. Bacteriophage 
lambda was selected as the expression vector for three 

30 reasons. First, in vitro packaging of phage DNA is the 
most efficient method of reintroducing DNA into host 
cells. Second, it is possible to detect protein expres- 
sion at the level of single phage plaques. Finally, the 
screening of phage libraries typically involv less diffi- 

35 culty with nonspecific binding. The alternative, plasmid 
cloning vect rs f are only advantageous in the analysis f 
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clones after they have been identified. This advantage is 
not lost in the present system because of the use of 
iambda Zap, thereby permitting a plasmid containing the 
heavy chain .light chain, or Fab expressing inserts to be 
excised. 

To express the plurality of V H -coding DNA homologs- in 
an JU-SSli host cell, a vector was ^t^cte^^ placed 


ail Ct COAA «WO w ww*^ 9 ~ 

provided a ribosome binding site as described by . Shine et 
al., Nature . 254:34, (1975), provided a leader ^sequence 
directing the expressed protein to the periplastic space, 
provided a polynucleotide sequence that coded for a known 
epitope (epitope tag) and also provided a polynucleotide 
^at coded for a spacer protein between the ^-coding DNA 
homolog and the polynucleotide coding for the epitope tag. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 
single stranded polynucleotide segments of 20-40 ^bases 
that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 25A. The 
individual single-stranded polynucleotides (N,-N 12 ) are 
shown in Table 13 below. 


n^frle 13 
Seq. 
25 J A - Wo ' 

(91) Nl) 5 

(92) N2) 5 

(93) N3) 5 

(94) N4) 5 
30 (95) N5) 5 

(96) N6) 5 

(97) N7) 5 


GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 ' 
AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 
GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 ' 
. » » r«T^r~T>nn*. n a aTTCTAGACTAGGTTAAT, 


3' 


AGG' 
TO 


'ATTACTCGCTGCCCAACCAGCCATGetcu • j 
(GTGAAACTGCTCX3AGAATTCTAGACTAGGTTAATAG 3 • 
JGACTATTAACTAGTCTAGAATTCTCGAG 3 « 
CAGTTTCACCTGGGCCATGGCTGGTTGGG 3' 
CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 
,o 8 ) N8) 5« GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 3 « 

99) N9-4) 5 • AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 

100) Nil) 5' GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 • 
(101) H12) 5' TCGACGAATTCTATTAAGAACOGTAGTC 3" 
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(102) N10-5) 5' CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3- 
Polynucleotide N2, N3, N9-4', Nil, N10-5-, N6, N7 and 
N8 were kinased by adding 1 Ml of each polynucleotide (0.1 
ug/MD and 20 units of T, polynucleotide kinase to a solu- 
5 tion containing 70 mM Tris-HCl at P H 7.6, 10 mM MgCl 2 , 5 mM 
DTT 10 mM beta mercaptoethanol , 500 ug/ml of BSA. The 
solution was maintained at 37'C for 30 minutes^ andjhe_ 

" reaction stopped by maintaining the solution at 65-C for 

10 minutes. The two end polynucleotides, 20 ng, of poly- 
10 nucleotides Nl and polynucleotides N12, were added to the 
above kinasing reaction solution together with 1/10 volume 
of a solution containing 20 mM Tris-HCl, P H 7.4, 2 mM MgCl 2 
and 50 mM NaCl. This solution was heated to 70'C for 5 
minutes and allowed to cool to room temperature, approxi- 
mately 25-C, over 1.5 hours in a 500 ml beaker of water. 
During this time period all 10 polynucleotides annealed to 
form the double stranded synthetic DNA insert shown in 
Figure 25A. The individual polynucleotides were 
covalently linked to each other to stabilize the synthetic 
DN A insert by adding 40 Ml of the above reaction to a 
solution containing 50 mM Tris-HCl, pH 7.5, 7 mM MgCl 2 , : 
1 mK DTT, 1 mM ATP and 10 units of T4 DNA ligase. This 
solution was maintained at 37'C for 30 minutes and then 
the T4 DNA ligase was inactivated by maintaining the 
solution at 65'C for 10 minutes. The end polynucleotides 
were kinased by mixing 52 Ml of the above reaction, 4 Ml 
of a solution containing 10 mM ATP and 5 units of T4 
polynucleotide kinase. This solution was maintained at 
37-C for 30 minutes and then the T4 polynucleotide kinase 
was inactivated by maintaining the solution at 65 'C for 10 
minutes. 

The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector prepared in Example 
27 that had been previously digested with the restriction 
35 enzymes NotI and Xhol. The ligation mixture was packaged 
according to th manufacturer's instructions using 
Gigapack II <3 Id packing extract (Stratagene) . The pack- 


20 


25 


30 
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a ,ed ligation fixture was plated on XLl-blue cells 
: (Latagene, . Individual Laabd. Sap II pU*~! were cored 
and the inserted excised according to the in^ excision 
protocol provided by the -anufacturer (Stratagene) . This 

s Loss -"i-*- s"* 0001 eonverts tte cl0M * ! 11» 

Sei^a zap II vector into . plas»id vector to allow 

rnfry ...nl pulatlon and seq u encing. The accuracy o t the 

above cloning steps was confirmed by sequencing the insert 
using the Sanger dideoxy »ethod described in by Sanger et 
10 ^"V- rn r 74:5463-5.67, (1977, and 

10 u 'in, the Manufacturer's instructions in the AHV averse 
Transcriptase *S-ATP seguencing Kit ' ^ 

sequence of the resulting V, expression vector is shown in 

Figure 2SA and Figure .26. 

15 B. y[™^^ ed itmminoSAP H _ 

To create a f usion-PCR library from hybridoma RNA for 
expressing the plurality of V H -coding DNA 
coL host cell, a vector based on the ImmunoZAP H vector 
described above was constructed. The procedure for con- 

20 structing the vector was performed as described above wi£ 
the following modif ication,: elimination of the sacl site 
between the T 3 polymerase and HotI sites and changing the 
nucleotide base residue sequence from AAA to CAG which 
resulted in an amino acid residue change from lysine to 

25 glutamine as shown in Figures 24A and 24B. 

The individual single-stranded polynucleotides <N„ 
Ni , n 6 and N 7 , , which were modified from their counterparts 
listed in Table 14, are listed in Table 14 below. 

*«)•>;) e 14 
30 Seq. 

^12^1) 5- AGCTGCGGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3« 
128) N2) 5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3« 
M29) N3) 5' GTTATTACTCGCTGGCCAACCAGCCATGGCCC 3- 
35 130 N4) 5- AGGTGCAGCTGCTCGAGAATTCTAGACTAGGTTAATAG 3' 
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(131) N5) 5 • TCGACTATTAACTAGTCTAGAATTCTCGAG 3» 

(132) N6) 5« CAGCTGCACCTGGGCCATGGCTGGTTGGG 3' 

(133) N7) 5« CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3« 

(134) N8) 5' CTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGCGGCCGC 
5 3* 

(135) N9-4) 5« AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 • 

(136) Nil) 5 • GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 1 

(137) N12) 5 • TCGACGAATOC^TTAAGAACCGTAGTC. 3_' — — 

(138) N10-5) 5 « CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3 « 

10 The modified ImmunoZAP H vector was created to 

eliminate an unnecessary Sad site in the ImmunoZAP H 
vector, (Example 28a, when the heavy and light chain 
vectors were combined. The modifications also improved 
the efficiency of secretion of positively changed amino 

15 acids in the amino terminus of the expressed protein, 
inouye et al., p™«. Watl. ^rt. Bci. USA. 85:7685-7689 
(1988) . 

29. Preparation of V L Expre ss i on Victor- TmmunoZAP L 
Construction 

20 To express the plurality of V L coding polynucleotides 

in an E. coli host cell, a vector was constructed that 
placed the V L coding polynucleotide in the proper reading 
frame, provided a ribosome binding site as described by 
Shine et al.. Nature . 254:34, (1975), provided a leader 

25 sequence directing the expressed protein to the peri- 
plasmic space and also provided a polynucleotide that 
coded for a spacer protein between the V L polynucleotide. 
A synthetic DNA sequence containing all of the above 
polynucleotides and features was constructed by designing 

30 single stranded polynucleotide segments of 20-40 bases 
that would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 25B. The 
individual single-stranded polynucleotides (N,-N 8 ) are 
shown in Table 13 above. 

35 Polynucleotides N2, N3, N4, N6, N7 and N8 w re 

xinased by adding 1 #tl of each polynucle tide and 20 units 
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10 


15 


of T» polynucleotide kinase to a solution containing 70 mM 
Tris-HCl/ P H 7.6, 10 DM MgCl 2 , 5 mM DDT, 10 mM 2MB, 500 
micrograms per ml of BSA. The solution was maintained at 
37-C for 30 minutes and the reaction stopped by maintain- 
ing the solution at 65'C for 10 minutes. The two end 
polynucleotides 20 ng of polynucleotides Nl and poly- 
nucleotides N5 were added to the above kinasing reaction 
solution together with l/io volume-of^solution-contain- 

ing 20 mM Tris-HCl, P H 7.4, 2 mM MgCl 2 and 50 mil Nad. 
This solution was heated to 70'C for 5 minutes and allowed 
to cool to room temperature, approximately 25«C, over 1.5 
hours in a 500 ml beaker of water. During this time 
period all of the polynucleotides annealed to form the 
double stranded synthetic DNA insert. The individual 
polynucleotides were covalehtly linked to each other to 
stabilize the synthetic DNA insert with adding 40 Ml of 
the above reaction to a solution containing 50 a* Tris- 
HC1, P H 7.5, 7 mM MgCl 2 1 mM DTT, 1 mM ATP and 10 units of 
T4 DNA ligase. This solution was maintained at 37'C for 


T4 DNA ligase 

20 


T4 UNA liyasB. ■ - ■ . 

30 minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65'C for 10 minutes. The end 
polynucleotides were kinased by mixing 52 /il of the above 
reaction, 4 *a of a solution recontaining 10 mM ATP and 5 
units of T4 polynucleotide kinase. This solution was 
25 maintained at 37 «C for 30 minutes and then the T4 poly- 
nucleotide kinase was inactivated by maintaining the 
solution at 65 *C for 10 minutes. 

The completed synthetic DNA insert was ligated 
directly into a Lambda Zap II vector prepared in Example 
30 27 that had been previously digested with the restriction 
enzymes NotI and Xhol. The ligation mixture was packaged 
according to the manufacturer's instructions using 
Gigapack II Gold packing extract and the packaged ligation 
mixture was plated on XLl-Blu cells as described in 
35 Exampl 28A. Individual lambda Zap II plaques were cored 
and the inserts excised according to the An_vovo excision 
prtocol as describ d in Example 28A. This in vj,vo , 


SUBSTITUTE 


SHEET 


WO 91/16427 


PCT/US91/02910 


146 


excision protocol converts the cloned insert from the 
Lambda Zap II vector into a phagemid vector to allow easy 
manipulation and sequencing and also produces the phagemid 
version of the V L expression vectors. The accuracy of the 
5 above cloning steps was confirmed by sequencing the insert 
using the Sanger dideoxy method described by Sanger et 
Wa +1. A Cad : rnri-. PSA." 7 4:5463-5467, (1977) and 
using the manufacturer's instructions in the AMV reverse 
transcriptase 3S S-dATP sequencing kit (Stratagene) . The 

10 sequence of the resultin V t expression vector is shown in 
Figure 25B and Figure 27) . 

The V L expression vector used to construct the V L 
library was the phagemid produced to allow the DNA of the 
V expression vector to be determined. The phagemid was 

15 produced, as detailed above, by the in vivo excision pro- 
cess from the Lambda Zap V L expression vector (Figure 27) . 

30. ffonstTTictien of v ^ -fl vp r*"* 10 " Vectors and Library 
A . T.ieratton of Dlclstr ™ ^ nNA Molecules with Modified 
-TmminoZAP H 

20 In preparation for cloning a library enriched in V H - 

V L -coding (V ML ) dicistronic DNA molecules, PCR amplified 
products (human or mouse) prepared in Examples 24, 25, and 

* 26 (50 mM NaCl, 25 mM Tris-HCl, pH 7.7, 10 mM MgCl 2 , 10 mM 
p-mercaptoethanol, .100 ug/ml BSA, at 37 'C were digested 

25 with restriction enzymes Xhol and Xbal at a concentration 
of 60 units of enzyme per ug of DNA, and purified on a 1% 
agarose gel. After gel electrophoresis of the digested 
PCR amplified dicistronic DNA molecules, the region of the 
gel containing the DNA fragments of approximately 1360 

30 base pairs in size was excised, purified using Gene-Clean 
(BIO 101, La Jplla, California) , ethanol precipitated and 
resuspended in 10 mM Tris-HCl, pH 7.5, and 1 mM EDTA to a 
final concentration of 10 ng/ul. Equimolar amounts of the 
insert were then ligated overnight at 4'C to 1 ug f 

35 modified immunoZAP H vector, prepared in Example 28b, 
(Stratagene) previously digested with Xhol and Xbal. A 
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rt ion of the ligation mixture (1 ul) was packaged for 2 
portion ot me ^ r 4 aana ck Gold packaging 

hours at room temperature using GigapacK to p * * 

extract (Stratagene, and the packaged ™ 

ofa peJissive E^li (strain XLl-blue, lawn to 9»«U 

? JL The library was determined to consist of predom- 
plaques. The library _ non _ re combinant background, 

inantly V HL with less than 5% non recoup 


icre 



Screen for expression of V Hl dlcistronic molecules, 
io E<Ji «re infected to yield approximately 100 plaques 
^Sate Replica filter lifts of the plaques on an agar 
c la« were priced by overlap a nitrocellulose filter 
had been soaKed in 10 mH isopropyl beta- 
I^niogalactopyranoside on each plate with transfer for 

(capper — „„„-fH*-med the presence and 

20 tion of immunoreactive product confirmed p. 

^ caressing antibody that 

hound TT, plaques were ^^J^JS^." 

Ascribed above. Replica filters were inc 
5S ^-tetanus toxoid and washed. Positive plaques were 
Identified by autoradiography and isolated. The frequency 
o r,»sitive clones in the library was egu.valent to 
^uler of positive clones,/(nunber of plaques screened, 
X faction of plaques expressing V Bl) . Concentrated non- 
,0 adsorbed tetanus toxoid was iodinated with sodium iodide 
" - I ( 1CH. Xrvine. California, by the Choramine-T ^ as 

described in Botton et al... fiochea- I *. S39 
describee, in . dodo-Beads, Pierce, 

(1973) and available in a Jclt li°° 

Kocttord, r re-plated at approximately 100 

55 Human DNA clones were p* . 

phag per plagu side by side with the parental phage that 
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were used as templates for PGR amplification and screened 
in the primary antigen binding screen. The results of the 
screening procedure are seen in Figure 28. Similar 
signals between the parental clones and the V EL dicistronic 
DNA molecules demonstrated that the sequence differences 
introduced with the q,l« and V L primers did not adversely 
affect gene expression. Also, it should be noted xn 
Figure 28 that a random parental clonejthat didjipt^eact 
-with tetanus^o^idTT^ «** and after 


10 


15 


the PC* dicistronic fusion, as was the control ImmunoZAP 
H vector (IZ H) . 

(ii) Mouse • . „ 

Mouse antibody-producing plaques prepared in Example 
27 were screened for antibody expression with rabbit anti- 
Bouse heavy and light chain antibody (Cappel Laboratories) 
as described above. 


25 


30 


31. 
A. 


r>, ayae t er i za t1"" o-f gion^ Piristronic V„ L P*pertoj,re 
?n Expres s <on library 

v»Wfi Ca ti o ~ of Pre— »" d size of Si°J^ 

20 fHMstronic V,„ Kepertoire 

Bacteriophage from purified reactive plaques prepared 
in Example 30B were converted to the plasmid format by in 
vivo excision with R408 helper phage according to manufac- 
turer's protocol (Stratagerie) and also described in Short 
et al., A C jds Res. . 16:7583-7600 (1988). In the in 

vivo excision protocol, the cloned insert from the 
ImmunoZAP H vector was converted into a phagemid vector to 
allow easy manipulation and sequencing. Briefly, phage 
plaques were cored from the agar plates and transferred to 
sterile microfuge tubes containing 500 ul of a buffer 
containing 50 mM Tris-HCl, pH 7.5, 100 mM NaCl, 10 mM 
MgS 0 4 , and 0.01% (w/v) gelatin and 20 ul of chloroform. 

For excisions, 200 ul of th phage stock, 200 ul of 
XLi-Blue cells (A^ - 1.00) and 1 ul of R408 helper phage 
(1 x 10 10 plaque forming units Opfu)/ml) were incubated at 


35 
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,,-c for 15 minutes. AFter a « hour incubation in turia- 

* «•»■ 70«c for 20 minutes to 

Bertani (LB) broth and heating at 70 C for 

neat kill the XLl-blue cells, the phagemxds were , » 
Lfected into XLl-Blue cell, and plated onto LB plates 
containing a^picillin. Double stranded DHA -prepared 
from the phagemid containing cells 

ods described by Holmes et al., OTl ■ HocW, , l«r"». 

oas ues. screened for DMA inserts by 
nosil. clones were first screened ^ 

^^^^ 

10 1390 base pair fragment on an agarose ge* 
presence of a % dicistronic molecule insert. 

Clones containing the putative V, insert were 
fenced using reverse transcriptase 
15 general method described by Sanger et al., ma^JistU 

Edifications 0, this method provided in the manufac- 
turer •. instructions in the AM* reverse transcriptase s- 
oXTP. sequencing kit (Stratagene) ^ ^ ^ ^ 

,n nucleotide sequence analysis of eevew » 

indicated ^ the sequence of the fusion region was 
identical to that shown in Figure 22, proving that the 
"cnes were actually generated through a fusion PC* 
intermediate . 

25 c. ^vantages 

Molecules . 

PCR amplification can, therefore, be used to fuse 
sequences responsible for encoding subunits of a hetero- 
dimeric protein together into a single DHA fragment that 
30 can «en direct the expression of both subunits fro- one 
egression vector. In the case of "J** 
X:e of nucleic acid template comes from hybridom. BKHA, 
«. r is only on heavy and light chain sequence to choose 
f ro», and thus the heavy: light pair is a -natural- pair. 
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However, if spleen, peripheral blood B-cell, or other 
lymphocyte mRNA is used as the source of template, the PCR 
fusion reaction to form a dicistronic DNA molecule can 
randomly pair heavy and light chains from different cells, 
5 producing a combinatorial library. In such a library, 
only a small fraction of the clones contain the original 
heavy and light chain pairs. This may not be a problem if 

the desired natural pair is well represented^ntte^orig- 

inal B-cell population, as is the case with hyperimmunized 
10 donors. However, if one wishes to find a naturally 
occurring rare specificity in a combinatorial library, one 
may have to screen a large number of clones. 

The fusion method presented here may offer a solution 
to the random combinatorial problem. . If one begins with 
15 a very dilute population of B-cells (possibly in a medium 
that limits diffusion), it may be possible for the dicis- 
tronic event to occur between naturally paired heavy and 
light chain sequences before significant mixing between B- 
cell BNA occurs. Thus, the fused heavy and light .chain 
20 sequences would be the original pairs, and the resulting 
library would express predominantly the naturally occur- 
ring antibody specificities. Such a library would be 
highly preferable when rare natural specificities are 
sought. 

25 Another advantage to this method is that only one 

vector and one cloning step are necessary. This saves a 
substantial amount of time, resources, and effort. 
Moreover, the ease of the single PCR reaction greatly 
simplified the process of going from B-cell RNA to an 

30 coli library, making this approach a noteworthy alterna- 
tive to standard hybridoma technology. 

The foregoing is intended as illustrative of the 
present invention but not limiting, numerous variations 
and modifications can be effected without departing from 

35 the true spirit and scope of the invention. 
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claims: 


1. A method of producing a nucleic acid vector 
encoding tvo or more desired genes, each from a family of 
aenes, said genes being capable of together producing a 
Lracteristic that can be used to identify the vector 
encoding said desired genes from other vectors encoding 
other combinations of genes from said families of genes, 
vhich-method_cpmprises: - - — ■ — — — 

a) randomly inserting into vectors one member from 
10 a first family of genes and one member from one or more 
other families of genes so that a population of vectors 
are created wherein each vector may contain one of the 
genes from said first gene family and one of the genes 
from each of said other gene families ; 
15 b) identifying within said population of vectors a 

vector capable of detectably producing a desired charac- 
teristic resulting from the inclusion of one gene from 
said first gene family and one gene from each of said 
other gene families, and using said characteristic to 
20 distinguish the vector from, other vectors within the 
population containing undesired combinations of gene 
members from said gene families. 

2. Otoe method of claim 1 wherein said genes are 
-inserted into a DNA vector at one or more integration 

25 sites, which method further comprises: 

a) preparing said vectors with one or more site- or 
region-specific recombination sequences; 

; b) permitting, in the presence of one or more 
reagents facilitating said site- or region-specific 
30 recombination, a member of said first family of genes to 
combine in a vector with a member of said second family of 
genes. • 

3 The method of claim 2 wher in said site- or 
region-specific recombination site is recognized and acted 
35 on by flp r combinase. 
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4. The method of claim 2 wherein said site- or 
region-specific recombination site is recognized and acted 
on by ere recombinase. 

5. The method of claim 2 wherein said site- or 
region-specific recombination site is recognized and acted 

-on-by-lambda-integrase recombinase . ; ■■ 


10 


15 


6. The method of claim 2 wherein at least one of 
the vectors contains a sequence capable of being 
recognized and acted on by transposase. 

7. The method in claim 1 where said genes are 
inserted into a DNA vector at one or more integration 
sites, which method further comprises: 

a) . cleaving said vector with one or more site- 
specific integration reagents; 

b) preparing the ends of genes from said first 
family of genes so that one end will ligate with an end of 
the vector cleaved by a first reagent and the other with 
an end of the vector cleaved by a second reagent; 

c) preparing the ends of said genes from said other 

gene families so that one end will ligate with an end of 
the vector cleaved by a third reagent and the other with 
an end of the vector cleaved by a fourth reagent ;- 

d) preparing at least one double stranded DNA 
linker fragment having one end ligatable to one end of 
said genes from said first family of genes^ and the other 
end ligatable to one end of genes from said other family 
of genes; 

e) mixing said vector, genes, and said linker 
fragment or fragments together in a ligation mix and 

30 ligating the components. 

6. The method of claim 7 wherein said reagents are 
the same. 


20 


25 
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the method of claim 8 wherein said reagents are 


10 


15 


20 


The method of claim 1 , wherein said combination 
of genes is accomplished in Xiyo.. 

- U. A method of producing a host cell expressing two 
-^^ore-desired^geneS,-each_f rom > «bm 

desired genes from other host cells expressing o 
Stions 6f genes from said families of genes, which 
method comprises, ^ ^ e 

«om a first family of genes and one member « rem one « 
»ore other families 6f genes so that a population of host 
::"s ^ created wherein each host cell ^-ntain one 
Z the genes from said first gen. family end one of the 

„ host cell capable of d.tectably exhibiting a desired 
chaotic 'resulting from the inclusion of one gen. 
£t om said first gene family end one gene from each of said 
o«L gene families, and using said characteristic to 
dist^st L host cell from other host c.U* "ithin^e 
p OPU la«on containing undesired combinations of gene 
members from said gene families. 

12 . The method of claim 11 wherein said vectors are 
Xambda bacteriophage vectors and the host cells are E. 
coli. 


30 


13 . A method of producing a nucleic acid vector 
^ino two or more genes belonging to families of genes, 
Tn «^le of pricing a characteristic that « M- 
us .d to identify the vector eno ding said genes from other 
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vectors encoding other members of the families of genes 
which method comprises: 

a) isolating a first population of vectors for 
which each member of said population may contain one 

5 member of a family of genes; 

b) inserting one member of a second family of genes 
into each of the vectors so that a population of vectors 

, ar e created where each vector may contain one of the gen es 

from said first family and one of the genes from said 

10 second family; 

c) identifying within said population of vectors a 
vector capable of producing a characteristic resulting 
from the inclusion of one gene from said first gene family 
and one gene from said second gene family, and using said 

15 characteristic to distinguish the vector from other 
vectors within the population containing other members of 
the gene families. 

14 i A method of producing a nucleic acid vector 
encoding two or more genes belonging to families of genes, 
20 said genes being capable of producing a characteristic 
that can be used to identify the vector encoding said 
genes from other vectors encoding other members of the 
families of genes, which method comprises: 

a) isolating a first population of vectors, for 
25 which each member of said population may contain one 

member of a first family of genes and a nucleic acid site 
or region at which the population of vectors can be 
combined with a second population of vectors; 

b) isolating a second population of vectors, for 
30 which each member of said population may contain one 

member of a second family of genes and a nucleic acid site 
or region at which the second population of vectors can be 
recombined with said first population of vectors so that 
one member of the first family of genes and one member of 
35 the second family of genes may be combined and expressed 
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in each member of a diverse population of recline. 

VSCt0 :r recombining populations of said first and second 
vectors and at said nupleio acid site or region thereby 

of which may express on. member of the 

geneS and one — « of the-con d ^ of 

- identifying _within — saia p_*v. _ 


A) identirying — _w>v"*« — =— - 

reco mbnant vectors a vecto* capable of .P~duc ng a 
onaracteristic resulting fro. the inclusion of one gene 
from each of said gene families > 

The De thod of claim 14 vherein said nucleic acid 
6 ite is' cl«ved with site-specific reagent, which method 

15 fUrth " leaving said first vector, population with said 
re89 T cleaving said second vector population with said 

"^"r ^ng both vector populations together in a 
20 ligation mix and ligating the two populations. 

16 The method of claim 14 wherein said nucleic acid 
oicn Is a homologous region capable of undergoing homo- 

^recombination. 

„ w .j_„ one or more members ox saw *■■>-'- 
inserting one t1 , nf carrvina out 

relations into a single host capable of carrying 
25 populations into * a1 i owi na such homologous 

homologous recombination and allowing 

recombination to occur. 

17 The method of claim 14 wherein said nucleic acid 

ite is a target site for site-specific recombination, 
site is a <Tie0 rtina one or more 

2 out site-specific ^ 

said nucleic acid site and allowing said sit specific 
recombinati n to occur. 
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18. The method of claim 17 wherein said target site 
for site-specific recombination is of the family of sites 
selected from flp, lox, and gamma-delta. 

19. The method of any of claims 1, 11, 13 or 14 
vherein said vectors are plasmid or cosmid vectors. 

20. The method of any of claims l f H, 13 or 1 4 

vherein said vectors are phage vectors. 

21. The method of any of claims 1, 13 or 14 wherein 
said vectors are lambda bacteriophage vectors. 

10 22. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the interaction of sequence-specific 
nucleic acids with genes from said first and second 
families of genes. 

15 23. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the hybridization of nucleic acid 
probes with genes from said first and second of. families 
of genes. 

20 24. The method of claim 14 wherein the identifica- 

tion of a particular vector within the recombinant vector 
population involves the expression of one or both of genes 
from said gene families as an SNA molecule. 

25. The method of claim 14 wherein the identifica- 
25 tion of a particular vector within the recombinant vector 
population involves the expression of one of both of genes 
from said gene families as an identifiable protein 

molecule. 
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26. The method of claim 25 wherein the protein 
molecule (s) contains a binding site for another molecule. 

27. The method of claim 26 wherein the protein 
molecule (s) contains an epitope recognized by an antibody. 

28. The method of claim 27 wherein the protein 
molecule (s) contains an immune molecule binding site for 
an epitope. 

29. The method of claims 14 wherein both genes 
express an RNA and/or polypeptide and said RNAs and/ or 

10 polypeptides physically interact within a host to create 
said characteristic. 

30. The method of claim 29 wherein both genes 
express polypeptides that physically interact , to form a 
neo-epitope recognized by an immune molecule. 

15 31. The method of claim 29 wherein both genes 

express polypeptides that physically interact to form a 
binding site for another molecule. 

32. The method of claim 31 wherein the polypeptides 
are derived from antibody genes such that the interaction 

20 of both polypeptides forms an antigen binding site. 

33. The method of any of claims 1, U, 13 or 14 
wherein the vectors contain a single promoter that 
expresses the genes from said gene families. 

34. The method of any of claims 1, 11,. 13 or 14 
25 wherein said genes from said gene families are each 

expressed from their own promoter. 

35. The method of claim 11 wherein the host is a 
mammalian cell. 
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i.i. 11 vherein th host is a 

36 . The method of claim 11 v 

eukaryotic cell. 

* 'i 11 wherein the host is a 

37 . The method of claim H « 

proXaryotic cell. 

_ clains 1, 11. 13 or 14 

_ 5 _ 38 „. The method ^^^ vo gene --fa m llies and the 

vherein there are more than ^ one ^ er 

vectors produced contein //^ eatc said characteristic. 
• of each gene family needed to cr 


39. HCFLP. 

40. LCFtP. 


4U. **w» — - 

' , biological agent having 

41 . A -ethod of producing a from 
a desired Phenotype wherein ***J^ nucleotide sequence 
Session of a particular to ^entity the 

Z vherei, said phenotype^an ^ meleotlae 
l5 biological agent having the parti 

sequence which cowrises: ^ ^ ^^tion of 

(a , bringing together ^ populations of 

nucleotide sequences, «i«vone « ^ ^ 
nucleotide sequences to pr ^ nucleoti de 

nucleotide sequence; ^ nucleotide sequences m 

(b) expressing saia w 

biological agents; and _ biologi cal agents vhich 

(c) identifying those 
25 express said desired phenotype. 

, to claim 41 vherein said 
42. A method according t biologica i agent 

phenotype can be used to gg*^ cofflbined nucleotide 
".. fr om hioological agents having o ^ phen type to 

distinguish » OBe 
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particular combined nucleotide sequence from biological 
= agents having other combined nucleotide sequences. 

43. A method according to claim 41 wherein said 
biological agent is a cell . 

5 44. A method according to claim 41 wherein said 
biological a gent is nucleic acid vector. 


45. A method according to claim 41 wherein said 
biological agent is a bacteriophage or virus. 

46. A method according to claim 41 wherein said 
10 phenotype results from expression of a hybrid polypeptide 

which is encoded by the particular combined nucleotide 
sequence and is encoded at least in part by one nucleotide 
sequence from each population of nucleotide sequences 
which was brought together. 

15 47. A method according to claim 41 wherein said 

phenotype results from expression of a plurality of 
polypeptides wherein a polypeptide is encoded at least in 
part by one nucleotide sequence from each separate 
population of nucelotide sequences which was brought 

20 together. 

48. A method according to claim 41 wherein two 
populations of nucleotide sequences are combined. 

49. A method according to claim 47 wherein said 
phenotype results from expression of a heterodimeric 

25 polypeptide wherein one subunit of said dinner is encoded 
at least in part by the nucleotide sequence from the first 
population of nucleotide sequences and the other subunit 
of said dimer is encoded at least in part by the nucleic 
sequence from the second population f nucleotide 

30 sequences. 
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50. A method according to claim 48 wherein said 
phenotype results from expression of a first polypeptide 
encoded at least in part by the nucleotide sequence from 
the first population of nucleotide sequences and of a 

5 second polypeptide encoded at least in part by the 
nucleotide sequence from the second population of 
nucleotide sequences. 

5 1. ^A^method^according— to^claim— 48— wherein— said— 
phenotype results from expression of an UNA molecule 

10 encoded at least in part by the nucleotide sequence from 
the first population of nucleotide sequences and a second 
KNA molecule encoded at least in part by the nucleotide 
sequence from the second population of nucleotide 
sequences. 

15 52. A method according to claim 48 wherein said 

phenotype results from synthesis of an ENA molecule 
encoded at least in part by the nucleotide sequence from 
the first population of nucleic acid sequences and by the 
nucleic acid sequence from the second population of 

20 nucleic acid sequences. 

53. A method according to claim 48 wherein the first 
and second populations of nucleotide sequences are 
combined by co-infection or cor-transf ormation of host 
cells. 

25 54. A method according to claim 48 wherein members 

from said first and second populations of nucleotide 
sequences are combined randomly to give combined 
nucleotide sequences. 

55. A method according to claim 41 wherein the 
30 combining of said populations of nucleotide sequences 
giv s a combined nucleotide sequence which was not 
previously expressed in said biological agent. 
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56. A method according to claim 41 wherein said, 
desired phenotype comprises a phenotype which was not 
previously expressed in a population of such biological 
agents. 

57. A method according to claim 41 wherein said 
first population of nucleotide sequences comprises non- 
identical nucleotide sequences. * 


58. A method according to claim 41 wherein each 
population of nucleotide sequences comprises non-identical 

10 nucleotide sequences. 

> 59. A method of producing a nucleic acid vector 
encoding a preselected combined nucleotide sequence which 
comprises two or more preselected nucleotide sequences, 
each independently selected, from a population of nucleo- 
15 ti de sequences, said combined nucleotide sequence being 
capable of producing a characteristic that can be used to 
identify the vector encoding said, preselected combined 
nucleic sequence comprises 

(a) bringing together a member nucleotide -sequence 
20 from each population of nucleotide sequences to .give a 

population of combined nucleotide sequences wherein each 
combined nucleotide, sequence comprises a nucleotide 
sequence from each population; 

(b) inserting into vector a member of the population 
25 combined nucleotide sequences so that a population of 

vectors is created wherein each vector may contain a 
combined nucleic acid sequence; 

(c) identifying within said population of vectors, 
a vector capable of detectably producing a desired 

30 characteristic resulting from inclusion of the preselected 
combined nucl ic acid sequence. 

60. A method according to clai* 59 wherein said 
characteristic can be used to distinguish the vector 
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encoding the preselected combined nucleotide sequence from 
other vectors encoding other combinations of nucleotide 
sequences further comprising using said characteristic to 
distinguish the vector from other different vectors within 
5 the population having unselected combined nucleotide 
sequences. 

61. A method according to claim 60 wherein said 
— nucleotide sequences are~combined randomly. ^- •"" 

62. A method according to claim 61 wherein said 
10 combined nucleotide sequences are produced using fusion 

polynucleotide amplification. 

63. A method according to claim 59 wherein said- 
combined nucleotide sequences are produced using fusion 
polynucleotide amplification. 

15 64. A method according to claim 1 wherein a 

dicistronic or multicistronic DNA sequence which comprises 
one member from the first family of genes and one member 
from one or more than families of genes which comprises a 
random combination of said members of said families of 

20 genes is synthesized using fusion polynucleotide 
amplification and inserted into vectors. 

65. A method for producing a biological agent having. 

a desired novel phenotype wherein said phenotype results 

from expression of a particular combined nucleotide 
25 sequence and wherein said phenotype can be used to 

identify the biological agent having v the particular 

combined nucleotide sequence; which comprises: 

(a) replicating at least portions of at least two 

parent nucleotide sequences under conditions that allow 
30 mutations to occur in either nucleotide sequence to 

generate a population of diverse replicas of each parent 

nucleotide sequence; 
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(b) randomly bringing together the populations of 
diverse replicas to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas; 

(c) expressing said combined nucleotide sequences in 
biological agents; and 

(d) identifying those biological agents which 

express said desired phenotype. ' 


10 


66. A method according to claim 65 wherein said 
desired phenotype is distinguishable from phenotypes 
expressed by said parent nucleotide sequences. 


67. A method according to claim 66 wherein said 
phenotype can be used to distinguish it from biological 
agents having other combined nucleotide sequences using 

15 said phenotype to distinguish those biological agents 
expressing the particular combined nucleotide sequence 
from biological agents having other combined nucleotide 
sequences . 

68. A method according to claim 65 wherein the 
20 parent nucleotide sequences comprise a single bNA molecule 

and are replicated together; further comprising separating 
the populations of diverse replicas of each parent 
nucleotide sequence prior to bringing together step (b) , 

69. A method according to claim 68 which comprises 
25 replicating two parent nucleotide sequences. 

70. A method according to claim 65 wherein the 
parent nucleotide sequences are separately replicated. 

71. A method according to claim 70 which comprises 
replicating two parent hucleotid sequences. 
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72. A method according to 71 wh rein a first parent 
nucleotide sequence is replicated in one population of 
cells and a second parent nucleotide sequence is repli- 
cated in a second population of cells and said cell 
populations are mixed and fused to generate cells which 
express combined nucleotide sequences. 

73 . A method according to claim 72 wherein 
~fi«t-parent~nucleotide sequences codesHfor a selected V, 


and said second parent nucleotide sequences codes for a 
10 selected V H , said cells are E. coli; and said combined 
nucleotide sequences express a Fab. 

74. A method for producing a biological agent having / 
a desired phenotype wherein said phenotype results from 
expression of a particular combined nucleotide sequence 

15 and wherein said phenotype can be used to identify the 
biological agent having the particular combined nucleotide 
sequence which comprises: 

(a) replicating parent populations of nucleic acid 
sequences to generate a population of diverse replicas of 

20 each parent population: 

(b) randomly bringing together the populations of 
diverse replicas to produce combined nucleotide sequences 
wherein each combined nucleotide sequence comprises one 
member of each population of diverse replicas ; 

25 (c) expressing said combined nucleotide sequences in 

biological agents; and 

(d) identifying those biological agents which 
express said desired phenotype. 

75. A method according to claim 74 wherein said 
30 desired phenotype is distinguishable from phenotypes 

expressed by said parent populations of nucleotide 
sequences. 
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76. A method according to claim 75 wherein said 
phenotype can be used to distinguish said biological agent 
from biological agents having other combined nucleotide 
sequences, further comprising using said phenotype to 
distinguish those biological agents expressing the 
particular combined nucleotide sequence from biological 
agents having other combined nucleotide sequences. 
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#L39 

TGGATCCGGCAGTT 


Subclass II (A) 
#L11 

GTAAAAnAGAGGCC 
#L03 

ctcg; 

CCTGI 

IGAAGCAGAAGCCT 


CTCGAGTCTGGAGCTnAGCTGGTAAAGCCTGGGGTT^GTGAAGATGT 
-CCTGCAAGGCTTCTGGA3ACACATT 


FIG. 5-2 
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CTCGAGTCTGGACCTGAACTGGTAAAGCCTGGGACTTCAGTGAAGATG 
TCCTGCAAGGCTTCTGGATAGACATTCACCRGCTATGTyATGCGCTGG 

GTGAAGCAGAAGCC 
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CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTTCAGTGAAGTTG 

TCCTGCAAGGCTTCTGGCTACACCTTCACllAGCTACTATATGTACTGG 
GTGAAGCAGAGGCC 

#L06 

OTCGAGTCTGGGGCTAAGCTGGTAAGGCCTGGAGCTTnAGTnAAGCTG 
^CTGnAGGGCTTCTGGCTACTGCTTCACllAnCTACTGGATGAAGTGG 

GTGAAGCAGAGGCC 
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FIG. 5-5 
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MODIFIED Vh EXPRESSION VECTOR: 


Not I RIBOSOME BINDING SITE 
5'— GAGCTCCGGCC(£AA^ 

3' mmukrnikmmmiKi 


M B LEADER 
HetLysTyrLcuUuProThrAloAlaAtaGfyLfiuUuLeuLeuAIa 
ATGAAATACC1ATTGCCTAW 

TACTnATGGATAACGGATGCWTCGGCGACCTAACAATAATGAI^ 

FIG. 24a. 

Ncol Xhol Xbol Spel 

AloGlnPnAloMetAlaGlnValGlnLeuLeuGlu 

GCCCAACCAGCCATGG()CCAGGTGttGCT 
CGGGHGmTACimmCGTimAGCTCTAAAGATCTCA 


EcoRI 

ScrTyrftoTyrAspVblPraAspTyrGlyScrStop 
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^ | RIBOSOHE BINDING SITE 
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CGTTTAAGATAAAGTTCCTCTGTGAGTAT 
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SerTyrPreTyrAspVolProAspTyrGlyScrStop 
AGTTACCCGTA()GACGnCCGG ACT 

TCAATGGGCATGCTGCAAGGC(nGATGCCAAGMTTATCnAAGCAGCT 


FIG. 25a 
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Vi EXPRESSION VECTOR: 
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